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Preface 


My  studies  of  optimal  control  theory  and  estimation  problems  prompted 
me  to  take  a further  look  into  improving  solution  techniques  for  matrix 
equations  fundamental  to  those  areas.  This  thesis  is  the  result  of  that 
investigation.  It  would  not  have  been  possible  without  Dr.  John  Jones, 

Jr.,  whose  imagination  and  fertile  mind  are  its  true  source. 

The  work  was  sponsored  by  the  Flight  Control  Division  of  the  Air  Force 
Flight  Dynamics  Laboratory,  Air  Force  Wright  Aeronautical  Laboratories, 
Wright-Patterson  AFB,  Ohio.  I would  particularly  like  to  thank  Captain 
Randall  Gressang  and  Captain  Erie  Lindberg  for  their  personal  support. 

The  results  of  this  paper  should  be  of  great  value  to  the  United  States 
Air  Force  in  the  computation  of  solutions  of  many  diverse  problems  which 
require  solutions  of  the  types  of  equations  treated  in  this  thesis. 

Ms.  Mary  Baker,  my  typist,  deserves  a big  thank  you  for  making 
beautiful  order  out  of  a tangled  jungle  of  matrix  equations--a  truly 
monumental  task. 

My  wife  and  daughter  have  endured  the  most  throughout  this  entire 
AFIT  tour.  Their  patience  and  love  is  without  bound. 
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SOLUTIONS  OF  CERTAIN  ALGEBRAIC  MATRIX  EQUATIONS 

WITH 

APPLICATIONS  TO  ENGINEERING  SCIENCES 


I.  Introduction 


Matrix  equations  are  playing  an  increasingly  important  role  in  the 
solution  of  problems  in  the  engineering  sciences.  Two  of  the  most  widely 
encountered  are 

AX  - XB  = C (1.1) 


often  called  the  Liapunov  equation,  and 


XDX  +AX+XB+C=0 


(1.2) 


known  as  the  algebraic  matrix  Riccati  equation.  New  representations  of 
solutions  of  Eqs  (1.1)  and  (1.2)  along  with  original  techniques  for  com- 
putation of  solutions  and  conditions  for  uniqueness  are  presented  in  this 
thesis.  Included  also  are  treatments  of  more  general  forms  of  these 
equations. 

A growing  interest  in  the  Liapunov  method  of  analyzing  the  stability 
of  both  linear  and  nonlinear  systems  has  meant  commensurate  interest  in 
solving  Eq  (1.1),  as  with  Meirovitch  (Ref  l:Chap  6)  and  LaSalle  and 
Lefschetz  (Ref  2).  The  Liapunov  equation  is  also  the  governing  equation 
in  the  analysis  of  beam  gridworks  with  various  boundary  conditions  as 
studied  by  Ma  (Ref  3) . 
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I 


The  digital  computer  is  responsible  for  the  expanding  use  of  the 
matrix  Riccati  equation,  particularly  in  modern  control  theory,  which  has 
come  into  its  own  during  the  space  program  of  the  last  fifteen  years. 
Control  of  a booster  rocket  during  the  launch  phase  where  random  wind 
loads  require  reaction  from  an  active  control  system  is  a typical  example 
of  the  use  of  modern  control  (Ref  4).  In  fact,  as  R.  Howerton  states, 
"The  key  to  design  of  optimal  controllers  and  estimators  for  discrete 
time- invariant  systems  with  quadratic  cost  criteria  is  the  determination 
of  the  symmetric  positive  definite  solution  of  the  discrete  algebraic 
Riccati  equation"  (Ref  5), 

The  matrix  Riccati  equation  is  an  integral  part  of  the  optimal  esti- 
mation and  filtering  problem  pursued  by  Kalman  (Ref  6) , Deyst  (Ref  7) , 
Sutherland  and  Gelb  (Ref  8),  and  many  others.  Also,  orbital  trajectory 
optimization  has  drawn  a large  number  of  investigations  such  as  those  of 
Bryson  and  Ho  (Ref  9) , Breakwell  (Ref  10) , and  McReynolds  (Ref  11) . 

Applications  of  modern  control  theory  are  being  made  in  many  diverse 
fields.  For  example,  as  the  speed  of  ground  transportation  systems 
increases,  active  control  is  required  to  provide  the  damping  necessary 
for  structural  integrity  and  passenger  comfort  not  possible  with  passive 
control.  Investigation  in  this  field  has  been  accomplished  by  Bender 
(Ref  12) , Karnopp  and  Trikha  (Ref  13) , and  Sevin  and  Pilkey  (Ref  14) . 
Recently  modern  control  theory  has  found  use  in  civil  engineering  struc- 
tures analysis.  J.  Yang  has  shown  that  active  control  systems  can  insure 
higher  structural  reliability  and  safety  in  heavy  structures  by  dampening 
random  inputs  of  earthquakes  and  high  winds  (Ref  15) . 

Modern  control  theory  is  not  the  sole  user  of  the  matrix  Riccati 
equation.  The  positive  definite  solution  of  a matrix  quadratic  equation 
can  replace  the  spectrum  factorization  step  in  the  Wiener  theory  of 


2 


filtering  and  prediction  (Ref  16).  The  Riccati  equation  also  appears  in 
the  theory  of  multiwire  transmission  lines  (Ref  17),  and  Bellman, 
Vasudevan,  and  Ueno  have  shown  that  not  only  the  matrix  quadratic  equa- 
tion, but  higher  order  matrix  equations  as  well,  arise  in  transport 
processes  (Ref  18;  Ref  19). 

Throughout  this  thesis  capital  letters  denote  matrices  and  matrix 
functions,  while  lower  case  letters  represent  scalars  and  scalar  func- 
tions. T is  the  transpose  symbol,  and  I is  the  n by  n identity  matrix. 

The  supplemental  bibliographies  are  an  attempt  to  include  a compre- 
hensive listing  of  articles  which  provide  historical  and  developmental 
background  on  the  matrix  equations  considered  in  this  thesis. 


In  this  chapter  the  results  of  S.  K.  Mitra  (Ref  20)  are  used  to 
extend  the  results  of  W.  E.  Roth  (Ref  21),  J.  Jones,  Jr.  (Ref  22),  and 
others  to  obtain  a general  form  of  the  solution  of 

AX  - XB  = C (2.1 


This  equation  is  called  the  Liapunov  equation  after  Aleksander 
Mikhailovich  Liapunov,  who  lived  in  Russia  from  1857  to  1918.  He  devel- 
oped the  theory  for  and  created  methods  of  solving  problems  in  the  areas 
of  stability,  equilibrium,  and  motion  of  mechanical  systems. 

One  way  for  Eq  (2.1)  to  occur  is  by  taking  a linear,  time -invariant 
system  represented  by  (Ref  23:184) 

x = Ax  (2.2) 

where  x is  the  n state  vector  of  the  system  and  A is  a constant  n by  n 
matrix  which  may  represent  a closed  or  open  loop  system.  To  investigate 


where 


A P + PA  = -Q 


(2.5) 


Employed  in  this  chapter  are  A , defined  for  any  A by  AA  A = A, 
which  replaces  A+  used  in  an  earlier  representation  of  the  solution  of 
Eq  (2.1),  and  the  results  of  W.  E.  Roth  which  established  a necessary 
and  sufficient  condition  for  the  existence  of  a solution  of  the  Liapunov 
equation. 


2.1  Necessary  and  Sufficient  Conditions  for  the  Solution  of  Eg  (2.1) 

a. 

If  the  2n  by  2n  matrices  R and  R are  denoted,  as  in  W.  E.  Roth  (Ref 
21),  by 


A Cl  „ Ta  0 

= R = 

jo  bJ  , [p  b. 


(2.6) 


where  A,  B,  and  C are  given  in  Eq  (2.1)  then  a sufficient,  though  not 
necessary,  condition  that  the  matrices  R and  R be  similar  is,  of  course, 
that  A and  B have  no  common  characteristic  root.  In  this  case  the  solu- 
tion of  Eq  (2.1)  not  only  exists,  but  is  unique.  Here  f^(X)  = | A — XI | 
and  ffi(A)  = lB  -XI  | are  the  characteristic  polynomials  of  A and  B 
respectively.  W.  E.  Roth  (Ref  21)  established  the  following  result: 

Theorem  2.1  - Roth  (Ref  21).  A necessary  and  sufficient  condition 
that  Eq  (2.1),  where  A,  B,  and  C are  square  matrices  of  order  n with  ele- 
ments in  the  field  of  complex  numbers,  have  a solution  X with  elements  in 
the  field  of  complex  numbers,  is  that  the  matrices  given  in  Eq  (2.6)  be 
similar. 

Next,  a solution  X of  Eq  (2.1)  is  obtained  under  the  hypotheses  of 
Theorem  2.1  above.  If  the  following  2n  by  2n  matrices  are  denoted  by 
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m/a 


fA(H  - 


fg(R) 


(2.7) 


where  the  n by  n matrices  U,  M,  M,  and  N are  polynomials  in  the  matrices 
A,  B,  and  C given  in  Eq  (2.1)  and  fA(A)  and  ffi(A)  are  the  characteristic 
polynomials  of  A and  B respectively,  then  the  hypotheses  of  Theorem  2.1 
above  can  be  shown  to  imply  that  there  exists  a solution  X of  Eq  (2.1)  of 
the  form 


X = N M - MU  + N NMU 


(2.8) 


Theorem  2.2.  If  the  matrices  R and  R of  Eq  (2.6)  are  similar,  then 
Eq  (2.1)  has  a solution  X of  the  form  given  in  Eq  (2.8). 

Proof . From  Theorem  2.1,  if  X is  a solution  of  Eq  (2.1),  then  the 
following  equation  results: 

I X]  [o  Ml  |"l  -x]  lb  M + xul  |"l  xl  fl  -xl 

" “ fA(R) 

0 TJ  L°  uJ  L°  ij  L°  u j l°  *J  Ip  ij 

(2.9) 

■fA(A)  0 ] ro  O' 

s = 

0 fA(B)  0 fA(B) 


Equation  (2.9)  implies  that  M + XU  = 0.  Similarly, 

I X N M I -X  N -NX  + M IX  I -X 

fB(R) 

o i J L°  oj  t_o  ij  Lo  o J L°  ij  L°  i 
■fB(A)  0 1 [f  (A)  o’ 

tr  = 

0 f B (B)J  |_  0 0 


(2.10) 


Equation  (2.10)  implies  that  M - NX  = 0.  Therefore,  a solution  X of  Eq 
(2.1)  must  satisfy  the  pair  of  equations 


I. 


M + XU  = 0, 


M - NX  = 0 


(2.11) 


A necessary  and  sufficient  condition  for  NX  = M to  have  a solution 
is  that  NN  M = M,  and  the  general  solution  X^  is  then  given  by 


Xx  = N~M  + Y1  - N~NY1  Y1  arbitrary 


(2.12) 


A necessary  and  sufficient  condition  for  XU  = -M  to  have  a solution 
is  that  MU  U = M,  and  the  general  solution  X2  is  then  given  by 


X2  = -MU  + Y2  - Y2UU 


Y2  arbitrary 


(2.13) 


A necessary  and  sufficient  condition  for  the  pair  of  equations  given 
in  Eq  (2.11)  to  have  a common  solution  is  that  each  equation  individually 
has  a solution,  and  that  -NM  = MU.  A common  solution  X is  given  by  Eq 
(2.8).  A solution  X = X^  = X2  of  Eq  (2.1)  is  of  the  form  in  Eq  (2.8) 
where  = N M and  Y^  = -MU  . 

Other  sufficient  conditions  that  Eq  (2.1)  have  a solution  X with 
elements  in  the  field  of  complex  numbers  follow. 

Theorem  2.3.  If  fa(X)  a polynomial  of  degree  n > 1 in  A with 
coefficients  in  the  field  of  complex  numbers  such  that 


fa(W 


M R = f"A 

L°  °J » L°  B. 


(2.14) 


where  V and  N are  polynomials  in  A,  B,  and  C,  and  V exists,  then  a 
solution  X of  N - VX  = 0 is  also  a solution  of  Eq  (2.1). 

Proof . As  previously  stated  a sufficient,  though  not  necessary, 

'Xj 

condition  that  the  matrices  R and  R be  similar  is,  of  course,  that  A and 
3 have  no  common  characteristic  root.  In  this  case  a solution  of  Eq  (2.1) 
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not  only  exists,  but  is  unique.  But  this  is  also  necessary,  for  the  hypo- 


thesis here  implies  that  fa(B)  = 0,  whence  f^(X)  is  a multiple  of  the 
minimum  polymonial  of  B,  and  that  V = fa(A)  is  nonsingular,  whence  fQ(X) 
cannot  vanish  for  any  characteristic  root  of  A. 

The  matrices  f^(R)  and  R commute,  which  implies  the  following  iden- 
tities : 


AV  = VA, 


AN  - VC  - NB  = 0 


(2.15) 


If  X is  a solution  of  N - VX  = 0,  then,  using  Eq  (2.15),  the  following 


holds : 


0 = A(N  - VX) 


VC  + NB  - AVX 


V(C  + XB  - AX) 


(2.16) 


and  since  V exists  by  hypothesis,  X is  a solution  of  Eq  (2.1). 

Theorem  2.4.  If  f„(X)  is  a polynomial  of  degree  n > 1 in  X with 
coefficients  in  the  field  of  complex  numbers  such  that 


£b(R) 


[::] 


(2.17) 


where  N,  M are  polynomials  in  A,  B,  and  C,and  M exists,  then  a solu- 
tion X of  N + XM  = 0 is  also  a solution  of  Eq  (2.1). 

Proof . As  in  the  proof  of  Theorem  2.3,  a sufficient,  though  not 
necessary,  condition  that  the  matrices  R and  R be  similar  is,  of  course, 
that  A and  B have  no  common  characteristic  root.  In  this  case  a solution 
of  Eq  (2.1)  not  only  exists,  but  is  unique.  But  this  is  also  necessary, 

for  the  hypothesis  here  implies  that  f„(A)  = 0,  whence  f (X ) is  a roulti- 

p p 

pie  of  the  minimum  polynomial  of  A and  that  M = f^(B)  is  nonsingular, 
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whence  f^(X)  cannot  vanish  for  any  characteristic  root  of  B. 

The  matrices  of  f^(R)  and  R commute  which  implies  the  following 
identities : 


BM  * MB, 


AN  + CM  - NB  = 0 


(2.18) 


If  X is  a solution  of  N + XM  = 0,  then, using  Eq  (2.18),  the  following 


holds: 


0 = -(N  + XM)B 


-NB  - XMB 


-CM  - AN  - XMB 


-CM  + AXM  - XMB 


(-C  + AX  - XB)M 


(2.19) 


and  since  M exists  by  hypothesis,  X is  a solution  of  Eq  (2.1). 

Example  2.1.  Solving  Eq  (2.1)  with  the  above  technique  is  now 
illustrated  for 


n 


[::l 


(2.20) 


|1  0 1 3 


10  1 2 


0 0 0 -1 


(2.21) 


0 0 0 -1 


|R  _ XI I = -(X2  - X) (X2  + X)  = -fA(X)fB(X) 


(2.22) 
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fA  (R)  = R - R 
A 


ffi(R)  = R + R = 


0 0 0 -4 


OOO  -2 


0 0 0 2 


0 0 0 2 


2 0 2 2 


2 0 2 2 


0 0 0 0 


0 0 0 0 


N M 

.0  0 


[1/2  0]  [0  1/2 

U = 

1/2  Oj,  L°  i/2 


Using  Eq  (2.8)  to  find  X 


[1/2  O']  [*2  2l  fO  -4*|  fO  1/2 

1/2  oj  L2  2J  L°  "2J  IP  !/2_ 

[1/2  O]  [2  0]  b -4l  [O  1/2 

1/2  oj  [2  oj  [O  ~2J  L°  1/2 

u 


which  is  a solution  of  Eq  (2.1), 


2.2  Properties  of  Sets  of  Solutions  of  Eq  (2.1) 


(2.24) 


(2.25) 


(2.26) 


(2.27) 


In  this  section  some  properties  of  solutions  of  Eq  (2.1)  are  estab- 


lished. 


Theorem  2.5.  If  X,,  X_,  . . . , X are  solutions  of  Eq  (2.1),  then 
1 2 n 

— (X,  + X_  + • • • + X ) is  also  a solution  of  Eq  (2.1). 
n 1 2 n 

Proof.  If  X, , X.,  ...»  X are  solutions  of  Eq  (2.1),  then 
12  n 
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X1B  = C 


x2b  = C 


Adding  the  above  equations  gives 


A(X.  + X0  + • • • 4 X ) 
1 Z n 


X B = C 
n 


(xx  + x2  + 


(2.28) 


X )B  = nC  (2.29) 
n 


A(X.  + X.  + • • • + X ) 
l /.  n 


(X.  + X.  + • • • X )B  = C 
i z n 


(2.30) 


Theorem  2.6.  If  X^,  Xg*  • • . » X are  solutions  of  Eq  (2.1)  and 
n n a^ 

\ = m,  then  \ — X^  is  also  a solution  of  Eq  (2.1). 

i=l  i=l 

Proof.  If  X^,  X2>  . . . , X^  are  solutions  of  Eq  (2.1),  then 


OjAXj^ 


“A" = “1C 


a2AX2 


a^B  = a2C 


(2.31) 


a AX 
n n 


a X B = a C 
n n n 


Adding  the  above  equations  gives 


A(a1X1  + a.X.  + • • • + a X ) - (a.X.  + a„X„  + • • • + o X )B  = mC  (2.32) 

± ± z * nn  li  z z nn 


* 


ii 


III.  Solutions  of  the  Matrix  Equation  £ A XB  = C 

i=l 

Under  More  General  Conditions  on  A and  B 


In  this  chapter  solutions  of  matrix  equations  of  the  form 


l a.xb  = c 

i=l  1 1 


(3.1) 


are  obtained  under  rather  general  conditions  on  the  matrices  A^,  B^,  and 

C.  For  the  case  where  A^,  B^,  and  C are  square  matrices  with  elements 

belonging  to  the  field  of  complex  numbers,  solutions  are  obtained  in 

terms  of  C and  the  principal  idempotent  and  nilpotent  matrices  associated 
1c  1c 

with  and  {B^J.j^  (Ref  24:Chap  16).  Solutions  are  found  for 

certain  important  special  cases  of  the  general  linear  matrix  equation 
such  as  the  Liapunov  equation. 

If  A is  a n by  n matrix  having  elements  belonging  to  the  field  of 

complex  numbers  and  if  the  reduced  characteristic  function  <1> ( A)  factors 

into  linear  factors, 

v v v m 

<KX)  = (X  - a.)  J-(X  - a.)  . . . (X  - a ) S,  s > 2,  £ v < n (3.2) 

1 Z S i=l  1 


where  the  {a^}  are  the  distinct  characteristic  roots  of  A,  then  A has  the 
following  representation: 


A - l (a  A + A ) , 2 m £ n 

■»=i  J J J 


(3.3) 


where  the  matrices  (Aj),  { A^ } form  a complete  set  of  principal  idempotent 
and  nilpotent  matrices  with  the  following  properties: 


then  a solution  X of 


is  given  by 


XB  = D 


n DF. 

xB  = l -r1 
^ 3-1  b3 


n DF 

Proof.  If  X„  = y — r^-  then 

8 3-1  b3 


/ n DF.\  n DF 

v ■(,!,  4 ■ ■,!,  t; 


n DF 

l T1  (blFl  + b2F2  + 

j=l  Dj  1 Z Z 


n _ n 

l DF  = l DF.  = DI  = D 
j=l  J j=l  J 


+ b F ) 
n n 


Theorem  3.3 


II  II 

. if  a = y a e. ant*  ® = I b.F.  w®ere 

i=l  i“l  J 3 


a1  t 0, 


Ei  = Ei’ 


bj  f 0 
F ^ = F 

i 1 


EiEj  = FiFj  = °»  1 * i 

n n 

y e = i,  1 f = 1 

i=l  j=l  3 


then  a solution  X of  AXB  = G is  given  by 

x „?  l _dFj 

AB  i=i  j»i  aibj 
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Now  by  multiplying  both  sides  of  Eq  (3.17)  on  the  left  by  A 


, n n E.CF  A n n E.CF. 

I l iV  - l MiV 

1 i=i  j=i  ai  M i=i  j=i  ai  bj 


n n 

I l 

i=l  j=l 


l l (alEl  + “2“2 


a„E„  + 


+ a E ) 


EiCFi 


n n a^-b^ 


(3.19) 


n n a.E.  CF.  n n a,E_,CF. 

- I l ~ TT1*  l l AV1 

i-i  j-i  ai  bj  i-i  j-i  ai  bj 


Then  forming  the  following  differences  gives 

n n a. E.CF.  n n E.CF.b. 

“a-b-*a-bb-  I I -iV-  l l 4- qA 

A B i=l  j=l  i j i=l  j=l  i bj 
n n a.ICI  n n ICIb. 

■ l l -l  l 

i=l  j=l  ai  bj  i=l  j=l  ai  bj 
n n a.C  n n Cb^ 

■ i l jV  - s i rV 

i=l  j=l  ai  bj  i=l  j=l  ai  bj 


n n (a.-b.)C 


l l 


LZA1 


i=l  j=l  ai"bj 


= C 


(3.20) 


Example  3.1.  Equation  (2.1)  is  used  to  demonstrate  the  techniques 
just  presented  for 


3 -7  -20* 

’ 1 -3  3' 

'l  1 1 

A = 

0 -5  -14 

B = 

0 3-2 

C = 

-4  -6  0 

0 3 8 

> 

-i  -1  1_ 

9 

2 3 0 

(3.21) 


where 


J A-XI | = ( A— 1) (X-2) (X-3) 


(3.22) 


3 

-7 

-2o" 

0 

-11/2 

-11 

A = 

0 

-5 

-14 

= 1 

0 

7 

14 

_0 

3 

8_ 

_0 

-3 

- 6 
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I * 


2 

I B<  = 


J-l 


-3  -6  3 

2 4-2 

12-1 


(3.26) 


Next  It  is  seen  that 


l B • = 1 A.  | C = 

j-l  V \k=l  k 


111 
-4  -6  0 

2 3 0 


-3  -6  3 

2 4-2 

12-1 


= 0 (3.27) 


and  a^  - 1,  a^  - 2,  a^  = 3,  = 2,  and  = 1.  Therefore, 


^al~^2^  = ^a2-kp  = and 


18 


1 


A1CB2 


A2CB1  " 


18  42 

-6  14 


111 
-4  -6  0 

2 3 0 

" 1 11 
-4  -6  0 

2 3 0 


2 -3  0 

2 3 0 

2 3 0 

3 3 O" 

2 -2  0 
2 -3  1 


Finally,  a solution  X of  AX  - XB  = C is  given  by 


3 2 (A,  CB  . ) 

*-  l l uVt 

k=i  j=i  uk  V 


= -A, CB,  + A_CB„  + 1/2  A_CB„  + A,CB 


11  2 2 


3 2 3 1 


0 11/2  ll|  f 1 1 i|  f 3 3 


-14  -4  -6  0 -2  -2 


18 

42 

-6 

-14 

2 3 0J  \-2  -3  1J 

1 1 ll  f-2  -3  0l 


4 -6  0 2 3 0 + 

2 3 0 2 3 0 


2/4  -25/4  -31/2 1 \ 1 1 lj  [- 2 -3 


1 -25/2  -31 

0 0 0 

0 0 0 

0 0 o' 

0 0 0 

0 0 0 


•4  -6  0 2 3 0 + 

2 3 Oj  2 3 0_ 

1 1 l]  f 3 3 dl 


-4  -6  0 -2  -2 


.2  3 Oj  L-2  -3  1J 

12  18  61 


0 + -4  -6  0 + 


n 


-6  -17/2  -1 


3 6-3 


0 0 0 


O'  -2  -4  2 = 0 0 0 = 


-2  -3  0 -1  -2  11  0 0 0 


and,  as  a check. 


f 5 13/2  il 


f 6 15/2  3] 


(3.33) 


II  t'-lf-O 


i 


3.2.  Solution  of  the  General  Linear  Case  of  Eg  (3.1) 


Theorem  3.5.  If  A^,  A^,  B^,  B^,  and  C are  n by  n matrices 


with 


elements  in  the  field  of  complex  numbers,  and  if  A^  and  B^  have  the 


following  representations: 


Ai  = I (aij  Aij  + Afj^»  1 ~ 1>  2;  1 i nf  < n 


Bi  = (bik  Bik  + V’  i = 1.  2;  1 < n.  < n 


with 


A£j  C CB^  i * 1>  2;  j - 1,  2,  . . . , n^ 


k 1,2,  . . « , n . 


and 


aijx  ^ + a2j2  b2k2  ° 


whenever 


Aiji  A2j2  CBlk1  B2k2  0 


CAij  " Aij  C 


B..  C = CB,, 
ik  ik 


then 

has  a solution  X given  by 


AIXB1  + A2XB2  = C 


ni  no  ni  n?  Ao  j CB,,  B_,  \ 

I l I l v 1J1  2J2  lkl  2kl) 


Jj-1  V*  VI  VI  (alj1  blkx  + a2j2  b2k2) 


Proof.  If  X is  given  by  Eq  (3.39),  then 
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(3.34) 


(3.35) 


(3.36) 


(3.37) 


(3.38) 


(3.39) 


A 


'•••—» 


A1XB1  + A2XB2 

y fAl  AU!  A2j2  C*l\  B2k2  B1  ^ A2  Alj1  A2j2  CBlkl  B2k2  B2 

(°«1  + ‘^2  “V  + (S«1  tlkl  * b2k2) 

k— kl— ni 


iiJil-i 


,alXAlA 


°1 

* Ai>)(A131A?J2CBlk1B2k2)(j1bi>1Bi>1  k »u) 


<alJx  S + a2J2  b2k2> 


n2  n2 

(ix  *»,  *2,  ^ ^ s xXi, 


b_  Bo  + B. 
2p  2u  2y, 


(alj1  blkx  + a2j2  b2k2} 


f al j ^Ik  Alj  A2 j 2CBlk1B2k2  a2j ?b2k2Al j 1 A2 j 2CBlkl B2k, 

S • V * • 4,  V 


h— ki— ni 


1 (Ali 
L<j..<nJ  iJl 


A2j  2 CBlk1  B2k2 


2— ki— ni 


Therefore,  X given  by  Eq  (3.39)  is  a solution  of  Eq  (3.38). 

If  {Ai>,  {Bi>,  and  C have  the  representations  listed  in  section  3.1 


CAij  = V - 0 - BikC  = CBik 

AijC  = CAij 


BikC  = CBik 


then  Eq  (3.1)  has  a solution  X with  elements  in  the  field  of  complex 


numbers  given  by 
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nl  n2 


X-  I l 


V1  V1 


"h  "l  n2 

l l l 

v1  Jr1  v2 


Alix  A2i2 


• A,  , CB. . B-.  • • • B, . 

hih  lkl  2k2  hkh 


22 “ h 1 2 

alj1  blkx  + • * * + b 


(3. 42) 


whenever 


3lj1  blkl  + a2j2  b2k2  + * * * + ahjh  X = 0 


(3.43) 


implies  that 


Ali  A2i?  ‘ - ' Ahi,  CBlk  B2k  * • * Bhk,  " 0 (3*A4) 

1 £ h 1 2 h 


The  proof  of  the  above  theorem  is  similar  to  those  in  the  earlier  sec- 


tions. 
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IV.  Solutions  of  AX  = C Under  More  General  Conditions  on  A and  C 


The  main  purpose  of  this  chapter  is  to  establish  necessary  condi- 
tions and  sufficient  conditions  for  the  existence  of  a solution  X of  the 
matrix  equation 


AX  = C 


(4.1) 


where  A,  X,  and  C are  n by  n matrices  having  elements  belonging  to  a 
polynomial  domain ^)(x)  where  Qis  the  field  of  real  numbers.  Such  equa- 
tions arise  in  the  work  of  B.  Porter  (Ref  25)  and  others. 


4.1  A Necessary  Condition  for  the  Existence  of  a Solution  X of  Eq  (4.1) 
Theorem  4.1.  If  A and  C are  n by  n matrices  having  elements  belong- 
ing to^)(x),  and  if  X is  a n by  n matrix  with  elements  in^(x)  which 
satisfies  Eq  (4.1),  then  the  following  2n  by  2n  matrices  are  equivalent: 


(4.2) 


[A  Cl  |A  O' 

0 ij,  |_0  I 

Proof . If  X is  a solution  of  Eq  (4.1)  with  the  elements  of  X 
belonging  toQ(x),  then 

II  -AXl  [A  cl  Tl  ol  [a  c-ax“|  [a  o' 

0 10  I_  _0  I_  _0  I _ 0 I 


and  the  matrices  of  Eq  (4.2)  are  equivalent. 


4.2  A Sufficient  Condition  for  the  Existence  of  a Solution  X of  Eg  (4.1) 
Theorem  4.2.  If  A and  C are  n by  n matrices  with  elements  belonging 


(4.3) 
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r~  ' 


toKJ(x) , and  the  following  2n  by  2n  matrices  are  equivalent 


A C 
0 A 


A 0 
0 A 


(4.4) 


then  there  exists  a n by  n matrix  X with  elements  belonging  to  \7(x)  which 
satisfies  Eq  (4.1). 

Proof.  There  exist  nonsingular  n by  n matrices  P and  Q with  ele- 
ments belonging  to<^(x)  such  that 


PAQ  A diag  {a^,  ^22*  * * * * 0»...»0)  (4.5) 


where  a^,  i = 1,  2,  ...  a are  the  invariant  factors  of  A,  i.e.,  a^ 


divides  a^ , i £ j,  i,  j = 1,  2, 


, a.  If  the  following  definitions 


exist 


P o]  [A  C 
0 Pj  [o  A_ 


fp  ol  fA  o' 


0 PJ  JO  Aj  |_0  Q 


Q o' 

0 0 
• m 

Q O' 


PA  PC 
0 PA 
PA  0 ' 
0 PA 


Q 0 

L°  Q 

Q O' 
o Q 


PAQ  PCQ 
_ 0 PAQ 
PAQ  0 ' 
0 PAQ 


A'  C’ 
0 A' 
A'  O' 
0 A’ 


* M 


(4.6) 


where  PAQ  = A’  and  PCQ  = C',  then  M is  equivalent  to  N,  since  the  matri- 
ces given  in  Eq  (4.4)  are  assumed  to  be  equivalent. 

It  will  be  shown  that  there  exists  a matrix  U with  elements  belong- 
ing to  \)(x)  such  that 


A’U  = C' 


(4.7) 


or,  by  elements, 


n * u = c * 

a nuij  c ij 


(4.8) 


* 
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where  a' 


€ A',  c' 


• » 


n. 


A1  € A',  c'y  € C',  and  u^  € U,  where  i,  j = 1,  2,  . , 
and  a'^  ° a^  given  in  Eq  (4.5).  This  will  be  accomplished  by  showing 


that  c'ij  is  a multiple  of  a'^  for  i,  j = 1,  2, 
c'^j  will  be  considered  in  four  separate  cases. 
The  2n  by  2n  matrix  M is  given  below  by 


, n.  The  elements 


(4.10) 


Case  1.  First  the  elements  c'^,  i,  j « 1,  2,  . . . , a will  be 
considered.  Now  if  a'^  and  a'^j  have  the  greatest  common  factor  g^ 
(g^j  ■ it:  can  he  shovm  that  c’^j  is  its  multiple  (of  a'^  - g^) 

and  as  a result  Eq  (4.8)  is  again  valid  in  \?(x) . If  g is  any  factor 


irreducible  inW(x)  which  is  a factor  of  a'^,  then  the  invariant  factors 
of  A are 


, _ i- 

3 ii  " 8 aii* 


rl  -r2  - * * * -ro 


(4.11) 


where  a^  is  a polynomial  in\7(x)  and  is  relatively  prime  to  g.  Conse- 
quently the  invariant  factors  of  M (of  N)  are 


k , 

“kk  = 8 m kk’ 


t <_  t2  i.  • • • 1 t 


(4.12) 


where  t^,  k = 1,  2,  . . . 2a,  is  a permutation  of  the  exponents  r^,  i = 1, 
2,  . . . 2a,  in  nondescending  order  and  where  m'^  is  prime  to  g. 

The  (i+j-l)th  determinantal  divisor  of  M (and  of  N)  is 


M = TTi+j-Vkm. 

Mi+j-l  nk=l  8 m kk 


(4.13) 


ri  ri 

It  contains  the  factor  g of  a'^.  or  the  factor  g J of  a'jj  for  in  fann- 
ing the  sequence  of  integers  {t^,  k = 1,  2,  . . . , 2i-l}  = {t},  either 
all  (r  , r.,  ...»  r } or  all  (r^,  r 2,  . . . , r.}  must  be  used  in 

X i l J r-  r 

i j 

order  to  get  its  i+j-1  terms.  On  the  other  hand  both  g and  g cannot 

be  factors  of  M.,,  , for  both  r.  and  r.  cannot  occur  in  the  sequence  {t} 
i+j-1  i J 

which  has  only  i+j-1  terms.  Neither  r^  nor  r is  zero  since  a'^  and  a'^ 

are  not  relatively  prime  unless  a' ^ = 1.  Consequently  the  minimum 

^il 

{r^r^}  * t^  belongs  to  { t } and  ^ must  have  the  factor  g . Now 

it  is  assumed  that  M'  and  N’  are  the  matrices  obtained  from  M and  N 


respectively  by  deleting  their  ith  and  (n+j)th  rows  and  columns,  and  that 

N ^ the  common  (i+j-2)th  determinantal  divisor.  N'  will  not  have 

ii 

the  factor  g J since  the  rows  and  columns  containing  a’^  and  a'jj  ware 
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deleted  in  forming  M'  and  N'  but  will  contain  as  factors  all  the  remain- 


ing powers  of  g that  occur  in 

Now  c'jjNi+j  2 is  a minor  °f  order  i+j-1  of  N and  as  a consequence 
is  a multiple  of 


_i+J-l  k 
nk=l  8 


(4.14) 


t.. 

which  is  a factor  of  Therefore,  c 1 ^ must  be  a multiple  of  g J , 

the  highest  power  of  g which  is  common  to  a'^  and  a'jj-  Since  g is  any 
factor  irreducible  in  sD(x)  and  a factor  of  a*  , and  since  t is  the 

VAtA  1J 

greatest  power  of  g common  to  a'^  and  a'jj»  it  follows  that  the  greatest 
common  factor,  g„  (g^  = a'^)  of  a'^..  and  a'jj  is  a divisor  of  c'^. 
Consequently  Eqs  (4.8)  are  satisfied  by  elements  u^  in,'3(x)  where  1 <_  i 
a,  1 < j < a. 

Case  2.  The  block  of  elements  c'^,  1 i <_  a,  a < j <^n  is  now 
examined . Here 


o’  TI^~^a,  na  a'  na  a' 
c ij  h=l  hh  h-i+1  hh  k=l  kk 


(4.15) 


is  a minor  of  order  2a  of  N;  it  must  also  be  a multiple  of 


7° 

h=l“  hh“k=la  kk 


(4.16) 


which  is  the  (2a) th  determinantal  divisor  of  M (of  N) . That  is,  c' 


ij 


must  be  a multiple  of  a’^  and  Eqs  (4.8)  are  satisfied  for  Case  2 by 


'ij 


(4.17) 


ii 


since  a’^j  may  be  regarded  as  identically  equal  to  zero  where  a < j n. 
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- . - - 


Case  3.  Similarly  Eqs  (4.8)  are  valid  for  the  block  of  elements 


c'^j,  a<i£n,  l£j£a.  A similar  type  proof  as  in  Case  2 holds. 
Case  4.  Finally, 


c'ij  * °’ 


a < i £ n,  a < j < n 


(4.18) 


otherwise  N would  have  the  nonzero  minor 


c'  n“  .a'  , n“  a ' 
ij  h=l  hh  k=l  kk 


(4.19) 


of  order  2ct+l,  which  is  impossible.  Hence,  c' ^ is  identically  zero  and 
any  elements  in  'Q(x)  will  satisfy  Eqs  (4.8)  since  both  a'^  and  a'jj 
may  be  regarded  as  identically  zero  in  case  a < i <_  n and  a < j n. 
Equations  (4.8)  are,  therefore,  valid  for  i,  j = 1,  2,  . . . , n. 

Therefore,  U = {u^.}  is  a n by  n matrix  with  elements  which 

satisfy  Eq  (4.7).  Now  A'  = PAQ  and  C'  = PCQ  and  since  P and  Q are  non- 
singular, Eq  (4.7)  reduces  to 


[PAQ]U  = [PCQ] 

A[QUQ_1]  = C (4.20) 

and  X = QUQ-1 

with  elements  in  £l(x)  such  that  Eq  (4.1)  is  satisfied  and  the  theorem  is 
proved . 

Example  4.1.  The  following  example  illustrates  Theorem  4.2  for  the 
matrix  equation 


AX  = C 


(4.21) 


where 


,~‘+  -v'-yjFHjp'ip*.  ^ — 


-1  1+(A-1) (A+l)  0 

1 -1- (A-l) (A+l)  A(A-l)2(A+l)(A+2)4 


A2(A-l)(A+l)2(A+2)5  0 


-1  A2-A2(A-1)(A+1)  0 

1 -A2-A2(A-1)(A+1)  A ( A-l) 2 (A+l) ( A+2) 5 


By  using 


(4.22) 


-A3 (A-l) 3 (A+l) 2 (A+2) 5 0 


(4.23) 


1 0 0 0 0 


110  0 0 


0 0 0 1 0 


.0  0 0 0 1. 


1 1 0 0 0l 


0 10  0 0 


0 1 1 0 0 1 and  Q = 1 0 0 1 0 0 


0 0 0 1 0 


.0  0 0 0 ll 


(4.24) 


A’  - PAQ 


0 (A-l) (A+l) 


A(A-l)2(A+l)(A+2)4 


A 2 (A-l) 3 (A+l) 2 (A+2) 5 0 


(4.25) 
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0 A (A-l) (A+l) 


0 


0 


C' 


= PCQ  = 


0 

0 

.0 


0 

0 

0 


A (A-l) 2 (A+l) (A+2) 5 0 

0 -A3 (A-l) 3 (A+l) 2 (A+2) 

0 0 

(4 


Taking  A'  and  C'  to  form  M 


1 0 


0 


0 0 | 


0 (A-l) (A+l) 

0 0 

0 0 


0 

A(A-1)2(A+1) (A+2)4 


0 0 | 

0 0 I 


0 


A 2 (A-l) 3 (A+l) 2 (A+2) 5 0 I 


■ 
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V.  Solutions  of  Riccati  Matrix  Equations 

Of  all  nonlinear  differential  equations,  the  Riccati  differential 
equation 

y' (z)  = f Q ( z ) + fx(z)y(z)  + f2(z)[y(z)]2  (5.1) 

can  claim  to  be  the  most  important.  It  has  a close  connection  with  the 
second  order  linear  case,  moveable  singularities  which  appear  as  simple 
poles,  and  a general  solution  that  is  a fractional  linear  function  of  the 
constant  of  integration  (Ref  26:273).  Count  Jacopo  Francesco  Riccati 
lived  in  Italy  from  1676  to  1754.  He  studied  Newton's  theories  of  gra- 
vitation and  published  works  on  philosphy,  differential  equations,  and 
mensuration  physics.  His  work  with  the  Riccati  equation  culminated  in 
1724  when  he  published  solutions  to  several  special  cases.  This  work  was 
done  before  the  introduction  of  matrices,  however,  and  the  term  matrix 
Riccati  equation  remains  somewhat  of  a misnomer. 

One  way  the  matrix  Riccati  equation  originates  is  the  following 
(Ref  23:237-239): 

A time  varying  system  represented  by 

x = Ax  + Bu  (5.2) 

is  associated  with  a linear  regulator  problem  with  quadratic  performance 
criterion 

1/2xTMx  + 1/2/q^[xTQx  + uTRu]dt  (5.3) 

The  Hamiltonian  is 
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u 


I i 


H = l/2x^Qx  + l/2u^Ru  + p^(Ax  + Bu) 


(5.4) 


OH 


and  for  optimality  — = 0 so  that 


- -1  T- 

u*  = -R  Bp* 


(5.5) 


The  adjoint  equations  are 


p*  = -Qx*  - A p* 


(5.6) 


and  by  substitution 


- - -1  T- 

x*  = Ax*  - BR  Bp* 


(5.7) 


The  solution  of  Eq  (5.6)  can  be  written  in  terms  of  a transition  matrix 
P: 


p*  = Px* 


(5.8) 


To  determine  P,  Eq  (5.8)  is  differentiated  to  get 


Px*  4 Px*  - p*  = 0 


(5.9) 


and  by  substituting  for  x*,  p*,  and  p* 


(P  + PA  - PBR'Vp  + Q 4 ATP)x*  = 0 


(5.10) 


Since  Eq  (5.10)  must  be  valid  for  all  time  0 £t  — ^ must  satisfy 

(5.11) 


-IT  T 

P = PBR  B P - A P - PA  - Q 


This  is  the  form  called  the  matrix  Riccati  equation  and  the  steady  state 
(P  = 0)  form  is  referred  to  as  the  algebraic  matrix  Riccati  equation. 

The  algebraic  matrix  Riccati  equation  in  the  form 


XDX  4 AX  4 XB  4 C = 0 


(5.12) 
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where  A,  B,  C,  and  D are  n by  n matrices  having  elements  belonging  to  the 
field  of  complex  numbers  is  considered  in  this  chapter.  Results  of  W.  E. 
Roth  (Ref  27)  are  extended.  The  coefficients  of  all  polynomials  which 
arise  in  this  chapter  belong  to  the  field  of  complex  numbers.  Therefore, 
the  similarity  of  matrices  and  the  reduceability  of  polynomials  remain 
valid  under  the  rational  operations  of  the  field. 


5.1.  Necessary  Conditions  for  the  Existence  of  Solutions  of  Eg  (5.12) 


If  the  3n  by  3n  matrix  R is  given  by 


'V 

R = 


-BOD 
0 10 
-C  0 A 


(5.13) 


and  if  X is  any  solution  of  Eq  (5.12)  with  elements  belonging  to  the 
field  of  complex  numbers,  then  the  following  eight  theorems  result: 

Theorem  5.1.  If  X is  any  solution  of  Eq  (5.12),  then  the  following 
matrices 


R = 


-BOD 
0 10 
-C  0 A 


XD+A 

0 

(-XB-C)  + (XD+A)  (-X)' 

XD+A 

0 

0 

0 

I 

0 

= 

0 

I 

0 

D 

0 

-DX-B 

D 

0 

-DX-B_ 

(5.14) 


'V 

<v 

= R 


are  similar,  and  there  exists  at  least  one  triple  of  polynomials  fa(X)  of 


degree  a n,  g^(X)  of  degree  6 <_  n,  and  h^(X)  of  degree  y £ n with 


% *\j  *\j 

coefficients  in  the  field  of  complex  numbers,  such  that  f^(R) *gg(R) *h^(R) 


0,  where  fa(X) *gg(X) *h^ (X)  is  not  necessarily  the  minimum  polynomial 


II 
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satisfied  by  R but  is  a divisor  of  |R  - XI | such  that  fa(XD  + A)  = 0, 
gB(I)  = 0,  and  h (-B  - DX)  = 0. 

Proof.  If  X is  a solution  of  Eq  (5.12),  then 


-B  0 D 

0 0 i" 

O 

M 

o 

oio 

-C  0 A 

I 0 -X 

'V 

= R 


(5.15) 

XD+A  0 (-XB-C)+(XD+A) (-X) 

0 1 0 

D 0 -DX-B 

k- 

Therefore,  R and  R are  similar.  Now  [ R — XI j = | XD  + A - XI | • | I - XI | • 

| -DX  - B - XI | . If  f£(X)  = |R  - XI | = f0(A)*gg(X)*h  (A)  then  0 = 

f^  (XD  + A) *gg(I) 'h^(-DX  - B)  and  at  least  one  polynomial  must  vanish. 

Polynomials  f (X),  gD(X),  h (X)  of  degree  a<n,  B<n,  Y<n 
a p v — — — 


respectively  such  that  f a (X ) 'gg(X) *h  (A)  is  a divisor  of  | R — X I | and  a 
multiple  of  the  minimum  polynomial  satisfied  by  R are  called  an  admissi- 
ble class  of  polynomials  and  are  denoted  by_r^(R) . 

Theorem  5.2  (Ref  28).  If  ffl(A)  €^(R)  such  that  fQ(X^D  + A)  = 0 
where  X^  is  any  solution  of  Eq  (5.12),  and  if 


% 

f (R)  = 
a ' J 


U 0 M 
P Q R 
V S N 


(5.16) 


where  R is  given  in  Eq  (5.14)  and  where  U,  V,  M,  N,  0,  P,  Q,  R,  and  S are 
polynomials  in  the  matrices  A,  B,  C,  and  D,  then  the  following  equations: 


XjU  + V = 0, 


XjM  + N = 0, 


XjO  + S = 0 


(5.17) 
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have  a common  solution  X^  which  also  satisfies  Eq  (5.12).  In  addition, 
the  following  holds: 

(X1  0 I)fa(R)  =(000)  (5.18) 

Proof.  If  X^  is  a solution  of  Eq  (5.12)  and  if  fa(R)  is  given  by 
Eq  (5.16),  then  using  Eq  (5.15), 


0 

1 

0 

0 

I ‘ 

fa(XlI>fA)  0 

0 

0 

I 

0 

f (ft) 

a 

0 

I 

0 

= 

* f (I) 

a 

0 

I 

0 

0 

I 

0 

**  *** 

f (-DX.-B) 
a 1 

XjM-N  X^O+S  (X1U+V)+(X1M+N)  (-X^ 
R Q P-RX1 

M 0 U-MX1 


where  *,  **,  ***  denote  polynomials  in  A,  B,  C,  and  D.  Since  f^(X^D  + A) 
= 0 then  XjM  + N = 0,  X^O  + S = 0,  and  X^U  + V = 0 by  matching  0 compo- 
nents of  Eq  (5.19).  Equation  (5.18)  follows  immediately. 

Theorem  5.3.  If  h^(-DX2  - B)  = 0 such  that  h^(A)  €_^(^)  where  X2  is 
any  solution  of  Eq  (5.12),  and  if 


f\j 

\(R) 


U 0 M 
P Q R 
Lv  S Nj 


(5.20) 


'X,  — — — — = — _ — _ 

where  R is  given  in  Eq  (5.14)  and  where  U,  V,  M,  N,  0,  P,  Q,  R,  and  S are 
polynomials  in  the  matrices  A,  B,  C,  and  D,  then  the  following  equations: 

u - MX2  = °»  V - NX2  = 0,  P - RX2  - 0 (5.21) 

have  a common  solution  X2  which  also  satisfies  Eq  (5.12),  In  addition, 
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the  following  holds: 


'X/  p 

\(r)  r i 


o=o 


(5.22) 


Proof . If  X2  is  a solution  of  Eq  (5.12)  and  if  h^(R)  is  given  by 
Eq  (5.20),  then  using  Eq  (5.15), 


x2  0 I 


0 0 1 hY(X2IH-A)  0 


0 10  h_^(R)  0 10  = 


I 0 0 


I 0 -X. 


hY(I)  0 

***  hy(-DX2-B) 


X2MfN  X20+S  (X2U+V)+(X2M+N) (-X2) 


(5.23) 


where  *,  **,  ***  denote  polynomials  in  A,  B,  C,  and  D.  Since  h^-DX^B) 
= 0 then  U - MX2  = 0,  P - RX2  = 0 by  matching  0 components  of  Eq  (5.23). 
Now  U = MX2  and  (X2U  + V)  + (X,,M  + N)  (-X21  = 0 implies  X2U  + V -X2MX2  - 
NX2  = 0 implies  X2U  + V - X2U  - NX2  = 0 implies  V - NX2  = 0.  Equation 
(5.22)  follows  immediately. 

Theorem  5.4.  If  gg(A)  €_rf(R)  such  that  g^(I)  = 0 where  X3  is  any 
solution  of  Eq  (5.12),  and  if 


U 0 M 


gg(R)  = P Q R 
V S N 


(5.24) 


where  R is  given  in  Eq  (5.14)  and  where  U,  V,  M,  N,  0,  P,  Q,  R,  and  S are 
polynomials  in  the  matrices  A,  B,  C,  and  D,  then  the  following  equations: 
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Proof.  If  X satisfies  Eqs  (5.28),  then 


XOI  UOM  00  I 

010  PQR  01  0 

100  VSN  10  -X 


XM+N  XO+S  (XU+V)+(XM+N) (-X) 
R Q P-RX 

M 0 U-MX 


(5.30) 


If  X^  is  a solution  of  the  first  equation  of  Eq  (5.28)  and  Xj  is  a solu- 
tion of  the  second  equation  of  Eq  (5.28),  then  the  matrices  in  Eq  (5.29) 
are  similar. 

The  following  identities  implied  by  Rfa(R)  = fa(R)R  are  developed 
now  for  later  use  in  this  chapter: 


-BOD  UOM  UOM  -BOD 

010  PQR  = PQR  010 

-COA  VSN  VSN  -COA 


(5.31) 


-BU+DV 

-BO+DS 

-BM+DN 

'-UB-MC 

0 

UD+MA* 

P 

Q 

R 

= 

-PB-RC 

Q 

PD+RA 

-CU+AV 

-CO+AS 

-CM+AN 

-VB-NC 

S 

VD+NA 

(5.32) 


-BU  + DV  = -UB  - MC,  -BO  + DS  = 0,  -BM  + DN  = UD  + MA 

P = -PB  - RC,  Q = Q,  R = PD  + RA  (5.33) 

-CU  + AV  = -VB  - NC,  -CO  + AS  = S,  -CM  + AN  = VD  + NA 

Next,  solutions  of  the  following  pair  of  equations  are  related 


(X  0 I)fa(R)  = (0  0 0) 


(5.34) 


fa(R)  I 0 
0 = 0 


(5.35) 
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(X  0 I)[u  0 Ml 


V S N 


U 0 f 1*1  fo 


V S N -X 


XU  + V = 0 


U - MX 


xo  + s 


XM  + N = 0 


P - RX 


V - NX 


(5.37) 


Theorem  5.6.  If  Xx  is  a solution  of  Eq  (5.34)  and  f ^ (X)  €_^f(R)  such 


U 0 M 


fa(R)  = P Q R 
V S N 


(5.38) 


and  M is  nonsingular,  then 


X2D  + A = M (-DX^^  - B)M 


(5.39) 


It  will  be  shown  that  a unique  solution  of  Eq  (5.35)  exists  and  that 


U - mx2  = 0, 


V - nx2  = 0 


(5.40) 


Also,  if  X1  is  a solution  of  Eq  (5.34),  it  will  be  shown  that 


XjU  + V = 0, 


XjM  + N = 0 


(5.41) 


Proof.  Using  XjM  + N = 0,  X^U  + V = 0,  U - MX2  = 0,  and  V - NX2  = 0 
it  follows  that  X2  = M U and  X2D  = M ^UD.  Next,  from  the  identities  in 
Eq  (5.33) 
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X2D  = m“  (DN  - MA  - BM) 

= M_1  DN  - A - M_1BM 
= M~1(-DX1M)  - M_1BM  - A 
X2D  + A = M_1(-DX1  - B)M 


(5.42) 


Equation  (5.39)  follows. 


Theorem  5.7.  If  Xx  is  a solution  of  Eq  (5.34)  and  fa(A)  ^4(R)  such 


U 0 M 


fa(R)  = P Q R 
V S N 


(5.43) 


and  U is  nonsingular  then 


DX^  + B = -M(C  + X2B)U- 


(5.44) 


Proof.  If  U exists,  then  ^ = -VU-1  and  DXX  = -DVU-1,  then 

DXX  = -(BU  - MC  - UB)U-1 
= -(B  - MCU-1  - UBU'1) 

= ~(B  - MCU-1  - MX2BU-1) 

DX1  + B = M(C  + X2B)U_1 


(5.45) 


Theorem  5.8.  If  Xx  is  a solution  of  Eq  (5.34)  and  fa(A)  €_~?(R)  such 


U 0 M 


fa(R)  = P Q R 
V S N 


(5.46) 
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and  N is  nonsingular  then 


-1. 


i2D  + A = -N  (AX2  + C)M 


(5.47) 


Proof.  If  N * exists  then  V = NX^  and 


X2D  = N-1VD 

= N'"1(AN  - NA  - CM) 

= N_1AN  - A - N_1CM  (5.48) 

= -n“1ax]m  - A - n"1cm 
X2D  + A = -N_1(AX1  + C)M 


5.2.  Sufficient  Conditions  for  the  Existence  of  Solutions  of  Eg  (5.12) 

In  this  section  f is  a complex -valued  function  of  a complex  variable 
which  is  defined  on  the  spectrum  of  R such  that  Rf(R)  = f(R)R  (Ref  27), 
where 


% 

f(R)  = 


"u 

0 

m" 

-B 

0 

D 

p 

Q 

R 

II 

0 

I 

0 

V 

s 

N_ 

> 

-C 

0 

A 

(5.49) 


Theorem  5.9.  If  M ^ or  N ^ exists  and  X is  a solution  of 


f(R) 


d 


(5.50) 


where  R exists,  then  the  pair  of  equations 


A + XD  = I 
AX  + XB  + C + XDX  = 0 


(5.51) 
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Ij 

i 

) 


has  X as  a common  solution  given  by 

X = (I  - A)D~  + Z - ZDD* 

for  any  arbitrary  Z (Ref  20) . 

Proof. 


*\j 

f(R) 


(5.52) 


’ I* 

"u 

0 

m" 

' I* 

o“ 

0 

= 

p 

Q 

R 

0 

= 

0 

_-x_ 

_v 

s 

N 

-X 

0 

= 0, 

P - 

RX 

s 

0, 

V 

(5.53) 


(5.54) 


Using  R = PD  + RA  from  the  matrix  identities  in  Eq  (5.33)  and  P = RX  from 
Eq  (5.54) 


-R  + PD  + RA  = 0 
-R  + RXD  + RA  = 0 
Rf-I  + XD  + A]  =0 


(5.55) 


Since  R exists 


0 = -I  + XD  + A 

1 = A + XD 

X = (I  - A)D~  + Z - ZDD* 


(5.56) 


and  using  V - NX  = 0 from  Eq  (5.54)  plus  the  matrix  identities  in  Eq 
(5.33) 

0 =>  D(V  - NX) 

= DV  - DNX 

= (-UB  + BU  - MC)  - (UD  + MA  + BM)X 


(5.57) 
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= (-MXB  + BMX  - MC)  - (MXD  + MA  + BM)X 
= -MXB  + BMX  - MC  - MXDX  - MAX  - BMX 


-M(XB  + AX  + C + XDX) 


Since  M exists  this  implies  that  X is  a solution  of  Eq  (5.12).  Simi- 
larly, 


0 = A(V  - NX) 


= AV  - ANX 


= (-VB  - NC  + CU)  - (VD  + NA  + CM)X 


(5.58) 


= -NXB  - NC  + CMX  - NXDX  - NAX  - CMX 
= -N(XB  + AX  + C + XDX) 


Since  N exists  this  implies  that  X is  a solution  of  Eq  (5.12). 
Theorem  5.10.  If  M ^ or  U ^ exist  and  X is  a solution  of 


(v  0 I)f(R)  =(000) 


(5.59) 


where  0 exists,  then  the  pair  of  equations 


-I  = B + DX 


0=AX+XB+C+  XDX 


(5.60) 


has  X as  a common  solution  given  by 


X = D~(-I  -3)  + W - D~D 


(5.61) 


for  any  arbitrary  W (Ref  20) . 


Proof. 


(X  0 I)f(R)  = (X  0 I)  |U  0 Ml 


P Q R = (0  0 0) 


(5.62) 


V S N 
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0 = -(XM  + N)C 

= -xmc  - nc 

= -XUB  - XMC  + XUB  - NC 
= -X(UB  + MC)  + XUB  - NC 
= X(DV  - BU)  + XUB  - NC 
= XDV  - XBU  + XUB  - NC 

= -XDXU  - XBU  + XUB  - NC 

= -XDXU  - XBU  - AXU  + AXU  + XUB  - NC 

= -XDXU  - XBU  - AXU  - AV  - VB  - NC 

= -XDXU  - XBU  - AXU  - CU 

= -(XDX  + XB  + AX  + C)U 


(5.67) 


.-1 


Since  U exists  this  implies  that  X is  a solution  of  Eq  (5.12). 


5.3.  Uniqueness  Theorems 

Theorem  5.11.  If  the  following  conditions  hold 

(i)  DA  = BD 

(ii)  (-A  ± I)  = (-A  + I)D_D 

(iii)  (-B  ± I)  = DD~(-B  ± I) 

(iv)  Sufficient  conditions  that  XDX  - 2/3X  + 1/3C  = 0 have  a 

solution 


then 


0 = XDX  + AX  + XB  + C 
A = -XD  ± I 
B = -DX  ± I 


(5.68) 

(5.69) 

(5.70) 


have  a unique  solution  X.  Such  a solution  also  satisfies 


XDX  - 2/3X  + 1/3C  = 0 


(5.71) 


46 


Proof . Conditions  (i) , (ii) , and  (iii)  from  Mitra  (Ref  20)  insure 
the  Eqs  (5.69)  and  (5.70).  Condition  (iv)  is  satisfied  by  Theorems  5.9 


and  5.10.  In  addition,  if  and  X2  are  solutions  of  Eq  (5.68)  where 


X1  f X2,  then 


AX^  + XjB  + C + XjDXj^  = 0 


ax2  + x£b  + c + x2dx2  = 0 


subtracting, 


A(XX  - X2)  + (Xx  - X2)B  + X1DX1  - X2DX2  = 0 


A(X1  - X2)  + (Xx  - X2)B  + X1DX1  - X2DX1  + X2DX1  - X2DX2  = 0 

A(xx  - x2)  + (x1  - x2)b  + (x1  - x2)dxx  + x2d(x1  - x2)  = 0 

(A  + X2D)(X1  - X2)  + (X1  - X2)(B  + DXj_)  = 0 


Since 


A + XD  = ± I 


B + DX  = ± I 


±i(xx  - x2)  + (xx  - x2)(±i)  = 0 


ZX^^  = 2X2 


X1  = X2 


which  contradicts  the  hypothesis  and,  therefore,  the  solution  is  unique. 
Theorem  5.12.  If  M 1 or  N 1 exists  and  X is  a solution  of 


f (tori' 


0 = 0 


-X  0 


where  R exists,  then  the  pair  of  equations 


A + XD  = I 


(5.77) 


AX  + XA  + C + XDX  = 0 


(5.78) 


has  X as  a common  solution.  In  optimal  control  theory  where  D is  posi- 

T T 

tive  definite,  D = D , and  X = X , such  a solution  is  unique.  Also,  the 

T 

optimal  control  theory  notation  corresponds  to  A = A and  A = B. 

Proof.  By  Theorem  5.11,  Eqs  (5.77)  and  (5.78)  have  X as  a common 
solution.  If  X^  and  X^  are  solutions  to  the  Riccati  equation  where  X^  ^ 


A X1  + XjA  + XjDXj^  + C = 0 


A X2  + X2A  + X2DX2  + C = 0 


subtracting, 


A (XL  - X2)  + (Xx  - X2)A  + XjDXj^  - X2DX2  = 0 


A (X.  - X2)  + (Xx  - X2)A  + X1DX1  - X1DX2  + X1DX2  - X2DX2  = 0 

T ( 

A (X1  - X2)  + (Xx  - X2)A  + X1D(X]_  - X2)  + (X1  - X2)DX2  = 0 

(AT  + x1d)(x1  - x2)  + (x2  - x2)(a  + dx2)  = 0 


from  Eq  (5.77),  AT  + XD  = I and  since  D = DT  and  X = XT, 


(AT  + xd)t  = IT 
A + DTXT  = I 


A + DX  = I 


which  implies 


i(x1  - x2)  + (xx  - x2)i  = 0 


X1  - X2  + X1  - X2  = ° 


2X1  = 2X2 


X1  " X2 
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which  contradicts  tlie  hypothesis  and,  therefore,  only  one  unique  solution 


exists. 


Theorem  5.13.  If  M ^ or  U ^ exists  and  X is  solution  of 


(I  0 X)f(K)  =(000) 


(5.83) 


where  0 exists,  then  the  pair  of  equations 


A + DX  = -I 


(5.84) 


A X + XA  + C + XDX  = 0 


(5.85) 


has  X as  a common  solution.  In  optimal  control  theory  where  D is  posi- 

T T 

tive  definite,  D = D , and  X = X , such  a solution  is  unique. 

Proof . By  Theorem  5.11,  Eqs  (5.84)  and  (5.85)  have  X as  a common 
solution.  If  and  X 2 are  solutions  to  the  Riccati  equation  where  X^  i4 
X2,  and  by  using  Eqs  (5.79)  and  (5.80), 


(A  + X1D)(X1  - X2)  + (Xx  - X2)(A  + DX2)  = 0 


(5.86) 


from  Eq  (5.84)  A + DX  = -I  and  since  D = DT  and  X = X^, 


T T 
(A  + DX)  = -I 

T T T 
A + X D = -I 


(5.87) 


A + XD  = -I 


which  implies 


-IO^  - x2)  + (xx  - x2)(-i) 


X2  - X1  + x2  - Xi  = 0 


2X2  = 2Xl 


(5.88) 


x2  = xx 
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which  contradicts  the  hypothesis  and,  therefore,  only  one  unique  solution 
exists. 


5 


I 

t 


Example  5.1.  An  example  from  Martensson  (Ref  29:23)  is  used  to 
show  the  computational  technique  presented  in  the  previous  sections. 
Martensson's  equation  was  written 


ATX  + XA  - XBQ2  1BTX  + = 0 


(5.89) 


where 


at  = 


-3  -2 

-3  2 

o' 

A = 

B = 

. 2 K 

9 

-2  1 

9 

1 

B4  = [0  1], 


Ql  = 


0 0 
0 0 


Q2  = [1] 


Rewriting  in  the  form  of  Eq  (5.12)  where 


-1„T 


D = -BQ2  B = 


o' 

'o  o' 

[1] to  1]  = 

1. 

0 -1 

'o  o' 

-3  -2 

-3  2 

*0  0" 

0 o' 

X 

X + 

X + X 

+ 

= 

0 -1 

. 2 1 

-2  1 

o' 

°l 

o o 

-B  D 
-C  A 


-3-2  0 0 

2-1  0-1 
0 0-3-2 

0 0 2 1 


] R-AI | = (X-l)2(X+i)2  = 0 
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(5.90) 


(5.91) 


(5.92) 


(5.93) 


(5.94) 


i 


TABLE  I 

Combinations  of  Characteristic  Roots  of  R 


1 

-1 

1 

(x-i)(x-i) 

(x-i)(x+i) 

-1 

(X+1KA-1) 

(X+l) (X+l) 

Table  I shows  three  combinations  of  the  characteristic  roots  of  R to  be 
checked: 


(i)  Case  1 

(ii)  Case  2 

(iii)  Case  3 


R - 2R  + I 
R2  - I 


R + 2R  + I 


Case  1.  R - 2R  + I 


5 

-4 

0 

2 

-6 

4 

0 

0* 

1 

0 

0 

o" 

4 

-3 

-2 

0 

+ 

-4 

2 

0 

2 

+ 

0 

1 

0 

0 

0 

0 

5 

4 

0 

0 

6 

4 

0 

0 

1 

0 

0 

0 

-4 

-3 

0 

0 

-4 

-2 

0 

0 

0 

1 

- 

0 0 0 2 

0 0-2  2 

0 0 12  8 

00-8-4 


U M 
V N 


X = -NM 


-1 


12  8“ 

1/2 

-1/ 2 

-10  6' 

-8  -4_ 

1/2 

0_ 

. 6 -4. 

(5.95) 


(5.96) 


Case  2.  R - I 
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' “ -T  ' ■ ” - - 


5-402 
4-3-2  0 

0 0 5 4 

0 0-4-3 

4 -4  0 2 

4-4-2  0 

0 0 4 4 

0 0-4-4 


10  0 0 
0 10  0 
0 0 10 
0 0 0 1 


U M 
V N 


(5.9 


X = M_1U  = 


' 0 

-1/2 

4 -4' 

-2  2 

1/2 

0 _ 

_4  -4_ 

_ 2 -2_ 

(5.9 


also. 


X = -NM 


'4  4' 

‘ 0 

-1/2' 

-2  2 

_4  -4 

1/2 

0 _ 

. 2 ~l 

(5.9 


Case  3.  The  results  are  the  same  as  Case  1. 


Both  solutions  can  be  shown  to  satisfy  Eqs  (5.89)  and  (5.92). 
Example  5.2.  This  example,  taken  from  Denman  and  Beavers  (Ref  30 
69),  shows  several  solutions  not  found  by  other  techniques. 


o' 

Oj 

0 1 

4 O' 

o o' 

A = 

B = 

C = 

D = 

_1  -2_ 

9 

0 -2_ 

* 

0 1_ 

9 

0 -1 

(5. 1C 


R = 


-B  D 
-C  A 


0-100 
0 2 0 -1 

-4  0 0 0 

0-11-2 


(5.10 


|R-AI|  = (A2— 4) ( X 2— 1)  = 0 


(5.1( 


All  solutions  can  be  shown  to  satisfy  Eq  (5.12). 

Example  5.3.  The  techniques  of  this  chapter  can  also  be  used  to 
solve  the  Liapunov  equation  by  setting  D equal  to  the  zero  matrix.  The 
following  example  is  taken  from  Hagander  (Ref  31). 


-3  O' 

B = 

3 

o' 

C = 

1 

o' 

D = 

0 

o' 

.0  -2_ 

9 

0 

-2_ 

9 

0 

1_ 

9 

0 

0_ 

(5.111) 


R = 


-B  D 
-C  A 


3 

0 

-1 

0 


0 0 0 

2 0 0 

0-3  0 

-1  0 -2 


(5.112) 


| R-Al | = (A+3) (A-3) (A+2) (A— 2)  = 0 


(5.113) 


TABLE  III 

Combinations  of  Characteristic  Roots  of  R 


2 

-2 

3 

-3 

2 

(A-2)2 

(A-2) (A+2) 

(A-2) (A-3) 

(A-2) (A+3) 

-2 

(A+2) (A-2) 

(A+2) 2 

(A+2) (A-3) 

(A+2) (A+3) 

3 

(A-3) (A-2) 

(A-3) (A+2) 

(A-3)2 

(A-3) (A+3) 

-3 

(A+3) (A-2) 

(A+3) (A+2) 

(A+3) (A-3) 

(A+3)2 

k 

^ . 


Table  III  shows  there  are  ten  cases  to  consider,  however,  there  are  solu- 
tions for  only  two  cases  in  this  example: 

Case  L R2  + 5R  + 61 


'9 

0 

0 

o' 

15 

0 

0 

o" 

6 

0 

0 

o' 

0 

4 

0 

0 

+ 

0 

10 

0 

0 

+ 

0 

6 

0 

0 

0 

0 

9 

0 

-5 

0 

-15 

0 

0 

0 

6 

0 

0 

0 

0 

4_ 

_ 0 

-5 

0 

-10_ 

0 

0 

0 

b 
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— — 1 — — BBZ 


30  0 0 0 

0 20  0 0 
-5  0 0 0 

0-500 


U M 
V N 


-1 


X = -vu 


Case  2.  R - 5R  + 61 


5 0 
0 5 


1/30  0 

0 1/20 


1/6  0 1 

. 0 IMJ 


0 0 0 0 

0 0 0 0 

5 0 30  0 

0 5 0 20 


U M 
V N 


X = N 


= 


1 — — 

o 

o 

CO 

i*H 

*5  O' 

1/6 

o' 

L 0 1/2°. 

o 5. 

_ 0 

1/4 

(5.114) 


(5.115) 


(5.116) 


(5.117) 


These  solutions  can  be  shown  to  satisfy  Eq  (2.1).  Liapunov  equa- 
tions will  always  have  D as  a zero  matrix  which  will  always  produce  a 
zero  matrix  M.  Therefore,  as  illustrated  above,  the  only  two  possible 
combinations  which  produce  solutions  are  X = -VU  and  X = N V. 


5.4  A Possible  Iterative  Solution  Technique 

In  this  section  a complex-valued  function  f(z)  of  a complex  variable 
z defined  on  the  spectrum  of  R is  used  as  in  P . Lancaster  (Ref  32).  The 
following  form  of  Eq  (5.12)  is  used  in  this  section: 

AX  + XB  = C + XDX  (5.118) 

Theorem  5.14.  If  f(z)  = (z  + a)(z  -a)"1  where  a t 0 and  a is  real, 
and  if 
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(5.119) 


f (A)  ~ (al  - A)-1 (al  + A)  = U 


f(B)  " (al  + B) (al  - B)_1  - V 


(5.120) 


where  U and  V are  defined  as  in  P.  Lancaster  (Ref  32)  such  that  (al  - A) 
and  (al  - B)  ^ exist,  then  a solution  of 

X = UXV  - (U  - I)C(V  - I)  (U  - I)XDX(V  - I)  (5.121) 

la  la 


is  also  a solution  of  Eq  (5.118). 


Proof . Manipulation  of  Eq  (5.118)  gives 


AX  + XB  = C + XDX 


2aAX  + 2aXB  = 2aC  + 2aXDX,  (where  a 4 0 and  a is  real) 

a2X  + aXB  + aAX  + AXB  - a2X  + aXB  + aAX  - AXB  = 2aC  + 2aXDX(5.122) 

aX(al  + B)  + AX(aI  + B)  - aX(al  - B)  + AX(al  - B)  = 2aC  + 2aXDX 

(al  + A)X(aI  + B)  - (al  - A)X(aI  - B)  = 2aC  + 2aXDX 

If  a is  chosen  so  that  (al  - A)  ^ and  (al  - B)  ^ exist,  then  multiplying 
on  the  left  and  right  gives 

(al  - A)-1 (al  + A)X(aI  + B)  (al  - B)_1  - 
(al  - A)_1(al  - A)X(aI  - B) (al  - B)"1 
= 2a (al  - A)-1C(aI  - B)-1  + 2a(al  - A)-1XDX(aI  - B)-1  (5.123) 


Using  Eqs  (5.119)  and  (5.120) 

UXV  - X = 2a(al  - A)_1C(aI  - B)”1  + 2a(al  - A)_1XDX(aI  - B)-1  (5.124) 


= (z  + a)  , _ (z  + a)  -1  z + 

(z  - a)  1 (z  - a)  1 


z + a 


2a(z  - a) 


f (A)  - I = 2a(A  - al) 


(5.125) 


U - I = 2a (A  - al)  1 jg  (U  - I)  = (A  - al)"1  (5.126) 

~ (V  - I)  = (B  - al)"1  (5.127) 

X - UXV  = -2a(al  - A)-1C(aI  - B)"1  -2a(al  - A)_1XDX(aI  - B)"1  (5.128) 

where  (A  - al)"1  = -(al  - A)"1  (5.129) 

X - UXV  = (U  - 1)C[-  ~^-(V  - I)]  + (U  - I)XDX[-  -|^(V  - I)]  (5.130) 

X - UXV  = - |^(U  - I)C(V  - I)  - (U  - I)XDX(V  - I)  (5.131) 

X = UXV  - i-(U  - I)C(V  - I)  - ~(U  - I)XDX(V  - I)  (5.132) 
2a  2a 

If  the  conditions  of  Theorem  5.14  hold,  then  Eq  (5.121)  can  be  used 
to  obtain  a recursion  formula; 

X , . = UX  V - — -(U  - I)C(V  - I)  - ^-(u  - I)X  DX  (V  - 1)  (5.133) 

n+1  n la  la  n n 


which  can  be  used  to  find  the  solution  of  Eq  (5.118)  using  a suitable 
norm  as  an  initial  guess.  For  a good  initial  choice  of  the  following 
example  shows  that  by  using  Eq  (5.133)  it  may  be  possible  to  develop  a 
useful  algorithm  for  computing  the  solution  of  the  original  Riccati  equa- 
tion. 

Example  5.4.  Another  example  from  Martensson  (Ref  28:44)  is  used  to 
illustrate  the  recursion  formula  of  Eq  (5.133),  Rewriting  Martensson' s 
equation  in  the  form  of  Eq  (5.118)  gives: 


0 1 

0 1 

-l  l 

0 0 

X + X 

a 

+ X 

.1  °- 

_i  o 

. 1 -1 

p \ 
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r 


and 


A - XI  = X -1-0 


A = ± 1 


Choosing  a = 0.1  yields  the  following: 


-1 


and 


U - I = 2a (A  - al)  = .2 


V - I = 


9.9 


.1  1 

1 -.1 

'.1  1 

1 .1 


n-1 


2 

9.9 


1 1 

1 .1 


For  an  initial  guess  of 


X0  = 


4 2 
2 2 


Eq  (5.133)  produces 


5.07 

3.40~ 

' .17 

-.17' 

-.99 

-.99' 

— 

3.40 

3 . 07_ 

+ 

-.17 

. 17_ 

+ 

-.99 

-.99 

r\ 

_ .“1 

4.25  2.24 

2.24  2.28 


4.36 

2.36' 

and 

x2  = 

,2.36 

2.42_ 

(5.135) 

(5.136) 


(5.137) 


(5.138) 


(5.139) 


(5.140) 


(5.141) 


After  two  iterations  X compares  favorably  to  Martensson's  solution: 


4.41  2.41 

2.41  2.41 


(5.142) 


Further  investigation  of  Eq  (5.121)  should  yield  more  specific  results. 
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VI.  Solutions  of  the  Generalized  Matrix  Riccati  Equation 


r 


t 


The  generalized  form  of  the  matrix  Riccati  equation  arising  in 
transport  theory,  as  given  in  a series  of  papers  by  Bellman  and  Vasudevan 
(Ref  18;  Ref  19),  is  considered  in  this  chapter.  The  usual  matrix 
Riccati  equation  can  be  related  to  the  transport  of  a beam  of  particles 
through  a medium  of  finite  length  with  conservative  type  of  interactions. 
By  varying  the  picture  of  the  interactions  in  the  medium,  the  above 
authors  arrive  at  extensions  of  the  Riccati  equation  with  higher  order 
nonlinear  terms.  They  made  use  of  a linearization  of  these  generalized 
equations . 

The  form  of  the  Riccati  equation 

AX  - XB  = C + XDX  + XEXFX  (6.1) 

treated  in  this  chapter  can  be  derived  as  follows: 

If  one  considers  a system  of  linear  vector  equations 


-Z'  = AZ  + DXEY 

(6.2) 

and 

Y'  = BZ  + CY 

(6.3) 

with  boundary  conditions 


Z(a)  = C 

Y (0)  = 0 


(6.4) 


then,  from  the  system,  Eqs  (6.2)  and  (6.3),  the  goal  is  finding  Y(a), 
namely 


Y (a)  = X(a)Z(a) 


(6.5) 
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where  7.(a)  is  nonsingular. 

First,  a differential  equation  involving  X(a)  is  obtained  by  differ- 
entiating Eq  (6.5)  with  respect  to  a and  making  use  of  Eqs  (6.2)  and 
(6.3): 

Y'  = X'Z  + XZ' 

Y'  = X'Z  + X(-AZ  - DXEY) 

Y'  = X'Z  - XAZ  - XDXEY 

(6.6) 

BZ  + CY  = X'Z  - XAZ  - XDXEY 

BZ  + CXZ  = X'Z  - XAZ  - XDXEXZ 

therefore,  X’  = B + CX  + XA  + XDXEX 

The  last  of  Eqs  (6.6)  can  be  written  as 


X'  = B + CX  + XA  + XDX  + XEXFX 


(6.7) 


In  this  chapter  sufficient  conditions  for  the  existence  of  a solution  of 
the  nonlinear  equation  associated  with  the  last  of  Eqs  (6.6)  are  esta- 
blished. 


6.1.  A Necessary  Condition  for  the  Existence  of  Solutions  of  Eq  (6.1) 
Theorem  6.1.  If  X is  a solution  of  the  matrix  differential  equa- 
tion 


— + XB  - AX  + XDX  + XEXFX  + C = 0 
at 


then  the  following  matrices 


A 

dt 

XDX  + XEXFX  + C 

A 

o' 

_0 

B 

» 

0 

B_ 

are  similar. 


(6.8) 


(6.9) 
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Proof.  If  X is  a solution  of  Eq  (6.8),  then 


IX  A 77+XDX 
at 


0 I 0 


A — + XDX 
at 


+ XEXFX  + c]  fl  -X* 

B J [o  I 

+ XEXFX  + C + XBl  fl  -X* 

B J [o 

A -AX  + ~ + XDX  + XEXFX  + C + XbI  Ta  0" 
at 

B J [O  B_ 


and  the  matrices  of  Eq  (6.9)  are  seen  to  be  similar. 


6.2.  Sufficient  Conditions  for  the  Existence  of  Solutions  of  Eq  (6.1) 


In  this  section  four  theorems  concerning  sufficient  conditions  for 
solutions  to  the  matrix  differential  equation  and  the  algebraic  equation 
are  established.  These  results  extend  those  of  W.  Roth  (Ref  27)  and 
R.  Bellman  (Ref  18;  Ref  19). 

Theorem  6.2.  A sufficient  condition  for  the  algebraic  matrix 
Riccati  equation 


AX  - XB  = C + XDXEX 


to  have  a solution  is  that  the  equation 


[A  - A I ] X ( A ) - Y (A) [B  - AI]  = C + XDXEX 


have  a solution  X(A),  Y(A)  where 


X(A)  = X„  + AX,  + A2x0  + • • • + APX 
(J  i l p 


Y(A)  = Y + AY,  + A2Y„  + • • • + APY 

U 1 L p 


a r\ 

X = X.  + X,B  + X-B  + • • • + X BP 

0 12  p 


1 


Proof . Rewriting  F.q  (6.12)  using  Eqs  (6.13)  and  (6.14)  gives  the 
following  equation: 


[A  - XI] [XQ  + XX2  + X X2  + • • • + XpXp]  - 


[YQ  + XYX  + X Y2  + 


AAA 

+ XpYp][B  - XI]  = C + XDXEX  (6.16) 


Equating  coefficients  of  like  powers  of  X and  multiplying  the  equations 
on  the  right  by  I,  B,  B2,  . . . , BP+3_  gives  the  following  set  of  equa- 
tions : 


AXq 

-YoB 

= C + XDXEX 

AX  B 

-xoB 

-V2 

+ Y0B 

= 0 

ax2b2 

- V2 

3 

- Y2BJ 

+ YXB2 

= 0 

ax3b3 

3 

- x2bj 

-V4 

+ y2b3 

= 0 

AX  . BP_1 

- X „BP  1 

- Y BP 

+ y 

p-1 

P-2 

p-1 

p-2 

AX  BP 

- X . BP 

- Y BP+1 

+ y 

P 

p-1 

P 

p-i 

+ Y BP+1  = 0 


(6.17) 


Columnwise  addition  gives 


[AXQ  + AX^2  + AX2B3  + . . • + AXp_1BP  1 + AXpBP]  - 

[X„B  + X,B2  + X»B3  + • • • + X ,BP  + X BP+1]  = C + XDXEX  (6.18) 

01  2 p-1  p 
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I kJ- k 


Finally, 


A[Xq  + X1B  + X2B  + 


+ X ,BP_1  + X BP]  - 

p-1  p 


•y  _ 1 _ a a ^ 

[X„  + X, B + XnB  + • • • + X ,BP  + X BP]B  = C + XDXEX  (6.19) 

01  2 p-1  p 


and  so 


AX  - XB  = C + XDXEX 


(6.20) 


holds. 


Theorem  6.3.  A sufficient  condition  for  Eq  (6.11)  to  have  a solu- 
tion is  that  the  equation 


[A  - AI]X(A)  - Y(A) [B  - XI]  = C + YDYEY 


(6.21) 


have  solutions  X(A) , Y(A)  where  Eqs  (6.13),  (6.14)  and 


Y = Yq  + AYX  + A Y2  + 


+ APY 


(6.22) 


hold. 


Proof . Rewriting  Eq  (6.21)  using  Eqs  (6.13)  and  (6.14)  produces  the 
following  equation: 


[A  - AI] [XQ  + AX1  + A X2  + • • • + ApXp]  - 
[YQ  + AY  + A2Y 2 + • • • + APYp][B  - AI]  = C + YDYEY  (6.23) 


Equating  coefficients  of  like  powers  of  A and  multiplying  the  equations 
on  the  left  by  I,  A,  A2,  ....  AP+1  gives  the  following  set  of  equa- 
tions : 


t 

I 


I 


[AY  + A2Y-  + A3Y„  +•««+  APY  . + AP+1Y  ] - 

0 12  p-1  p 

[YqB  + AYXB  + A2Y2B  + • • • + AP_1Yp_1B  + APYpB]  = C + YDYEY  (6.25) 
Finally, 

A[Y  + AY.  + A2Y_  + • • • + AP_1Y  , + APY  ] - 

01  2 p-1  p 

[Y  + AY  + A2Y-  + • • • + AP-1Y  , + APY  ]B  = C + YDYEY  (6.26) 
u 1 z p—±  p 

and  so 

AY  - YB  = C + YDYEY  (6.27) 

holds. 

Example  6.1.  An  illustration  of  the  above  theorem  is  given  by  the 
following  one  dimensional  example: 
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-•-  '-I  ™ i • ™ww  r-r-~  ’-i  ”,  rwT  ■' ■> 


2y  = 1 + y 


(6.28) 


which  can  be  written  as 


[3]  [y]  - [y]  [1]  = [1]  + [y]  [1]  [y] 


(6.29) 


where 


A =*  3,  B = 1 C = 1,  D =*  1 


(6.30) 


The  equation 


[A  - AIJX(A)  - Y(A) [B  - AIJ  - C + Y-D-Y 


(6.31) 


has  solution 


Y - Yo  * 1 


(6.32) 


Simplifying  the  above  results  in  the  following: 


[A  - AI]X(A)  - 1 [B  - AI]  - 1 + 1.1*1 
[3  - A ] X ( A ) - 1(1  - A]  - 2 

[3  - A ] X ( A ) = 2 + 1 - A 
X(»  -f-TT-  1 


(6.33) 


and  1 is  easily  seen  to  be  the  solution  of  Eq  (6.28). 

Theorem  6.4.  A sufficient  condition  that  the  matrix  differential 
Riccati  Equation 


AX  - XB  = C + XDXEX  + 

at 


(6.34) 


have  a solution  is  that  the  equation 


[A  - AI]X(A,t)  - Y ( A , t ) [ B - AI]  = C + XDXEX  + 


(6.35) 


have  a solution  X(A,t),  Y(A,t)  where 
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(6.36) 


X(A,t)  = XQ  + tX±\  + t2X2A2  + t3X3A3  + • • • 

Y(A,t)  = YQ  + tY^A  + t2Y2A2  + t3Y3A3  + . . . 

X(t)  = XQ  + X1Bt  + X2B2t2  + X3B3t3  + • • • 


Proof.  Rewriting  Eq  (6.35)  using  Eqs  (6.36)  and  (6.37)  produces  the 
following  equation: 

[A  - XI] [X  + tX  A + t2X2A2  +...]- 
[Yq  + tYjA  + t2Y2A2  + • • • ] [B  - AI]  = C + XDXEX  + (6.39) 

Equating  coefficients  of  like  powers  of  A and  multiplying  the  following 
equations  on  the  right  by  I,  B,  B2,  . . . , gives  the  following  set  of 
equations : 


^0 

-v 

= C 

AXjtB 

-xoB 

- YxB2t 

+ Y0B 

= 0 

2 2 

AX2t  B 

- XjtB2 

- Y2B3t2 

+ Y1tB2 

= 0 

3 3 

AX3tJB 

- X2t2B3 

- Y3B4t3 

+ Y2t2B3 

= 0 

(6.40) 


AX  tV  - X tP  V - Y BP+1tP  + Y tP_1BP  = 0 
P P-1  p p-1 


- X tPBP+1 


+ Y tPBP+1  = 0 


Columnwise  addition  gives 


[AX0  + AXtB  + AX2tV  + . . . + AXptPBP 
[XqB  + XjtB2  + X2t2B3  + • • . + XptPBP+1  + • • • ] = C + XDXEX  + ^ (6.41) 


I 


J 


A[X_  + X, Bt  + X„B2t2  + • • • + X BPtP  + •♦•)- 

0 12  p 


9 9 n n A * a jv 

[X„  + X. Bt  + X„B  t + • • • + X BPtl  + • • * ]B  = C + XDXEX  + — (6.42) 

0 12  p at 


and  finally. 


AX(t)  - 


X(t)B  = C + X(t)DX(t)EX(t)  + d-^^~ 


Example  6.2.  The  preceding  theorem  is  illustrated  by  the  following 


scalar  Riccati  differential  equation; 


[ l ] x ( t ) - x(t) [-1]  = [l]  + x(t)[i]x(t)  + 


where  A = 1 , B = -1 , C=l,  D = 1 


The  equation 


[A  - XI]X(t,X)  - Y (t , X) [B  - XI]  = 


C + XDXEX  + 

at 


has  a solution  X(X,t),  Y(X,t)  given  below; 


2 3 4 

Y(X , t)  = 1 + ^X  - -^X2  + V3  " V4  + V5 
L C C C C 


= YQ  + YXX  + Y2X2  + Y3X3  + 


2 3 4 

. i.l-i  t .2  t . 3 t_,4  t ,5 

X(X,t)  - 1 + X - _x  + _x  — ,x  + _x 

L c c c c 


= XQ  + X.X  + X2X2  + X3X3  + 


Then,  by  Eq  (6.38) , 


2 3 4 5 

X(t)  = l+^-II  + i3--L4+i5-“6+- 
L C CJ  C C C 


XQ  + X^Bt  + X2B2t2  + X3B3t3  + • • • 
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^ — — — 


■ 

i 

i 


■ 


and 


dX(t) 

dt 


i_+  2t 

2 3 

C CJ 


dX„  dX. 


3t 2 4t 3 

c4  c5 


5t 4 + 6t 5 + 

c6  c; 


u i ^^2  2 2 ^3  3 3 

dr  + dTBt  + dTBt  +dTBt  + * 


(6.49) 


and  Eq  (6.44)  has  a solution  X(t)  given  in  Eqs  (6.48)  and  (6.49)  which 
can  be  written  in  the  form 


X(t)  = 1 + 


(t  + C)  » 


c = real  number  ^ 0 


(6.50) 


Theorem  6.5.  A sufficient  condition  that  Eq  (6.34)  have  a solution 
is  that  the  equation 

A 

[A  - Xl]X(X,t)  - Y (X , t) [B  - XI]  = C + YDYEY  + ^ (6.51) 

at 

have  a solution  X(X,t),  Y(X,t)  where  Eqs  (6.36),  (6.37),  and 

Y (t)  = YQ  + AYjt  + A2Y2t2  + AY3t3  + • • • (6.52) 


hold. 

Proof.  Rewriting  Eq  (6.51)  using  Eqs  (6.36)  and  (6.37)  produces  the 
following  equation: 

[A  - XI] [X  + tX  X + t2X  X2  + t3X-X3  +...]_ 

. (6.53) 

[Yq  + tY^^X  + t2Y2X2  + t3Y3X3  + • • • ][B  - XI]  = C + YDYEY  + ^ 

Equating  like  powers  of  X and  multiplying  the  following  equations  on  the 
2 

left  by  I,  A,  A , . . . gives  the  following  set  of  equations: 


AXq 

-YoB 

= C + YDYEY  + 

a2xl 

1 

£ 

o 

- AYXB  + AYq 

= 0 

a3x2 

- A2X1 

- a2y2b  + a2yl 

= 0 
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] - 


] - 


(6.54) 


(6.55) 


[Y_  + AY  t + A2Y„t2  + • • • + APY  tP  + • • • ]B  = C + YDYEY  + (6.56) 

U 1 L p at 


and  finally, 


AY (t)  - Y (t)B  * C + Y(t)DY(t)EY(t)  + 

at 


(6.57) 


The  following  three  examples  illustrate  the  techniques  presented  in 
Theorems  6.2  and  6.3  for  the  algebraic  matrix  Riccati  equation.  All 
three  present  solutions  to  the  equation 


AX  - XB  = C + XDXEX 


(6.11) 


using  increasing  powers  of  X in  the  intermediate  solution  matrices. 
Example  6.3.  If 
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B = 


then  the  following  exist: 


and  the  following  equation  is  satisfied: 


[A  - XI]X(X)  - Y(X)  [B  - XI]  = C + XDXEX 


Also,  X satisfies 


then  the  following  exist: 


(A+l) 2 l" 

*1  1 

2 0‘ 

X(A)  = 

? 

= 

+ A 

0 X +1 

.o  1 

0 0 

1 0 
0 1 


= XQ  + AXX  + A X2 


Y(A)  = 


’x2+l  -2A+5  * 

"l  5' 

* 0 -2 

O 

= 

4*  A 

-A+3  A -2A+6_ 

3 6. 

-1  -2_ 

1 0 
0 1 


" Y0  + AY1  + X Y2 


(6.64) 
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Now 


A 

9 

"l 

l" 

'2 

o' 

-1  -1 

X = X.  + X,B 

+ 

X„B  = 

4* 

JL 

Z 

_o 

1_ 

.0 

0. 

-1  -2_ 

M1 

°i 

“ 

-i  -l 

-1 

-i 

'l 

2 

o i 

_ 

-1  -2_ 

-1 

-2 

— 

.3 

6 

0 

l 

5" 

1 

l' 

"7 

o 

Y = Yn  + AY 

+ 

X Y = 

+ 

z 

.3 

6_ 

_0 

0_ 

-1  -2J 

1 1 

r1 

"l 

o' 

'l 

2 

-0 

o' 

o 

1 

.0 

1_ 

_3 

6 

and  the  following  equation  is  satisfied: 

[A  - XI]X(X)  - Y (X) [B  - XI]  = C + XDXEX 


i-A  i 

(A +1)2  1 " 

”x2+l  -2\+5  " 

"-1-A  -1  ' 

. 0 -A. 

0 x2+i_ 

_-X+3  X2-2X+6_ 

. "I  -2-X_ 

' -35  -71 

1 2 

'l  1 

1 2 

"i  o' 

'l  2' 

+ 

-117  -237_ 

_3  6_ 

.1  °. 

.3  6_ 

.0  i 

.3  6. 

'7 

13" 

*7 

13" 

9 

15. 

.9 

15. 

(6.65) 


(6.12) 


(6.60) 


Also,  X satisfies  Eq  (6.20): 


while  Y satisfies  Eq  (6.27) 


then  the  following  exist 


A +2A+1  1 


1-X  0 


\3+2X+l 


1 


x -JL 


x2+l 


rX3-2X2+4X-5  -X2+4X-11 


2 

-A  +X-3 


X2-X+2 


36 

-66 

-22 

-3 

|-2  -8' 

'l  o' 

-2  -8“ 

ri  o] 

P2  -81 

i 

o 

_1  1_ 

i 

o 

H-* 

1 

o 

ii£_ 

-1-X  -1 

0 -2-X 

(6.71) 


-4 

-26" 

-4 

-26" 

.-2 

-2_ 

-2 

-2_ 

Also,  X satisfies  Eq  (6.20): 


1 0 
1 0 


-2  -8 
0 5 


-2 

-8' 

-1 

-1 

_ 

' 36 

-66' 

+ 

-40 

40' 

0 

5_ 

_ 0 

-2_ 

.“22 

-3. 

. 20 

5_ 

-4 

-26] 

-4 

-26' 

-2 

2. 

O 
~~  c. 

2_ 

(6.72) 


while  Y satisfies  Eq  (6.27). 

Example  6.6.  This  example  demonstrates  the  techniques  in  Theorem 
6.4  for  the  matrix  differential  equation 


AX  - XB  = C + XDX  - 

at 


(6.73) 


and  the  following  equation  is  satisfied; 

[A  - Xi]X(X,t)  - Y (X , t) [B  - XI]  = C + XDX  - 


Also,  X satisfies 

AX  - XB  = C + XDX  - ~ 


f2t  t+f|  f2t  t+fl 


CL  ID. 


l 


, 


I ' 


VII.  A New  Method  for  Computing  Matrix  Inverses 


A generalization  of  the  classical  Newton-Raphson  method  of  finding 
zeros  of  functions  of  a single  variable  is  extended  in  this  chapter  to 
the  case  of  finding  zeros  of  functions  of  a matrix  variable. 

In  the  classical  problem  of  finding  zeros  of  a function  f(x)  by  the 
Newton-Raphson  method  the  following  algorithm  is  used: 


Xk+1  ~ Xk 


£(V 


:'<V  • 


k = 0,  1,  2,  . , . 


(7.1) 


to  produce  a sequence  {xk+^}  which  converges  to  a zero  (x*)  of  f(x)  = 0. 


Care  must  be  taken  in  choosing  an  initial  x^  so  that  the  above  sequence 


{x^+^}  actually  converges  to  a finite  number,  x*.  If  the  algorithm  is 


rewritten  as 


xfc+1  = x^(2  - axfc) , a = real  number,  a / 0 


(7.2) 


then  the  sequence  {x  - } converges  to  — if  and  only  if  x_  is  chosen  so 


that  |l  - 3Xq|  < 1 (Ref  32:30). 


If  f (x)  is  chosen  as  follows: 


f (x)  = x ^ - a 

(7.3) 

then 

f ' (x)  = = -x  2 

x 

(7.4) 

and  the  classical  Newton-Raphson  algorithm  may  be  written 


Xk+1  " Xk  f ’ (xk) 


f(xk) 
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1 

— - a 
Xk 


= x + x ( a) 

k k vx,  1 
k 


= xk  + Xk  - x,  a 


(7.5) 


= xk(2  - aXk) 


If  yk  = 1 - ax^,  then  = y^  and  it  is  easily  seen  that  the  condition 
|l  - axg|  < 1 is  a necessary  and  sufficient  condition  that  {xk+^}  con- 
verges to  some  x*  which  is  a zero  of  f (x) , that  is,  f(x*)  = 0. 

Example  7.1.  A one  dimensional  example  illustrates  the  critical 

choice  of  x . To  find  — given  a = 2,  f(x)  = — - a,  and  x = 3 the 
US  X U 

algorithm 


Xk+1  = xk(2  " 2xk) 


(7.6) 


generates  the  sequence  {3,  -12,  -312,  . . . } which  diverges.  However, 
if  Xq  is  chosen  as  1/4  then  the  sequence  (1/4,  3/8,  15/32,  . . . } con- 
verges to  1/2,  which  is  a zero  of  f(x). 

In  the  matrix  case  some  conditions  on  the  initial  choice  of  Xq  are 
necessary  to  guarantee  convergence  of  a sequence  {X^^}  to  X*,  a zero  of 
a matrix  function  F(X),  that  is,  F(X*)  = 0. 

Example  7.2.  A matrix  example  illustrates  the  critical  choice  of  X 


P 11, 


1 -1 


P II, 


(7.7) 


If  the  following  algorithm,  which  is  established  later,  is  used  for  the 


matrix  case,  namely, 
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xk+l  - xk(21  - xkA) 


(7.8) 


1 -1 


xi=  L J=X2  = X3 


(7.9) 


However,  for 


'2  0 
0 3 

0 -4 
0 -3_ 


(7.10) 


(7.11) 


0 -20 


0 -15 


(7.12) 


0 -600" 
0 -450 


(7.13) 


In  order  to  develop  the  algorithm  for  the  matrix  case  the  notion  of 
the  Frechet  derivative  of  a matrix  function  is  required.  X and  Y denote 
Banach  algebras  which  are  complete,  normed,  linear  vector  spaces  and  F(*) 
is  an  operator  from  X into  Y.  If  a bounded  linear  operator  L from  X into 
Y exists,  such  that 


F(XQ  + AX)  - F(XQ)  - L(AX)|| 


AX  bo 


(7.14) 


thenF(«)  is  said  to  be  differentiable  at  X^  and  the  bounded  linear  o,  ra- 
tor  F'(Xq)  = L is  called  the  first  Frdchet  derivative  at  X = Xq. 

The  following  results  are  used  to  develop  the  algorithm  for  the 
matrix  case  given  earlier,  and  to  establish  sufficient  conditions  on  the 
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r 


t t 


’ """  - 


choice  of  X^  to  guarantee  convergence  of  {X^+^}  to  some  X*  which  is  a 
..-1 


zero  of  F(X)  = X -A. 

-1 

7.1.  If  F(X)  = X - A is  a mapping  of  a Banach  algebra  X 


Theorem 


* * -1  * -1 
into  a Banach  algebra  Y,  where  for  a given  X € X,  X € Y,  and  (X  - A) 


€ Y,  then  the  Frdchet  derivative  of  F(X)  = X ^ - A is  F’(X)  = -X 


Proof . Using  the  results  of  Lusternik  and  Sobolev  (Ref  33:91),  if  X 
and  AX  are  elements  of  a Banach  algebra  and  if  X has  an  inverse  element 


X \ and  ||  AX  ||  < (||  X 1 ||)  \ then  (X  + AX)  also  has  an  inverse  written 


-1  n. -1 


as 


(X  + AX)  If  ||  AX  ||  < (||  X 1 || ) 1 and  if  X 1 exists,  then,  letting 


r(x)  = x - a, 


F (X  + AX)  - F(X)  = [(X  + AX)-1  - A]  - [X_1  - A] 


(7.15) 


= (X  + AX)  1 - X-1 


(7.16) 


If  X 1 and  V 1 exist,  then  the  following  identity  holds: 


X_1(X  - Y)Y_1  = Y-1  - X 1 


(7.17) 


Using  Eqs  (7.16)  and  (7.17),  and  letting  Y = X + AX, 


F (X  + AX)  - F (X)  = 


X_1(X  - X - AX) (X  + AX)-1 


= X_1 (-AX) (X  + AX)-1 


= -X-1 (AX) (X  + AX)-1 


-X-1(AX)(X  + AX)-1  + X-1(AX)X-1  - X 1(AX)X 


(7.18) 

-1 


the  following  form  of  the  above  is  desired: 


F (X  + AX)  - F (X)  = F’(X)(AX)  + a(X,AX) 


(7.19) 


lim 


a (X  ,AX) 


AX  bo 


AX 


= 0 


(7.20) 


1 
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Grouping  the  first  two  terms  of  the  right  side  of  the  last  of  Eqs  (7.18) 
gives 


F (X  + AX)  - 


= -X_1(AX)[(X  + AX)"1  - X_1]  - X"1(AX)X"1 
= -X~1(AX)X~1(-AX) (X  + AX)'1  - X~1(AX)X~1  (7.21) 

= X_1 (AX)X_1 (AX) (X  + AX)'1  - X'1(AX)X'1 


a(X,AX) 


X_1||  2-|l  AXl|  2-||  (X  + AX)'1! 


= II  x'1 1|  2.  ||  (X  + AX)  ||  • II  AX  ||  + 0 as  ||  AX  ||  - 0 (7.22) 

For  AX  sufficiently  small,  (AX)X  1 = X 1(AX)  and  the  Frechet  derivative 
of  F(X)  is  given  by 


F'(X)  = -X'2  = -(X'1)(X“1) 


(7.23) 


Theorem  7.2  (Kantorovich).  If  for  some  Xq  € DQ,  F'(XQ)  exists,  and 


(i)  I!  [F’(X0)]'-L||  < 6 


(ii)  II  [F’(X0)]'-LF(X0)||  < n 


(7.24) 


(iii)  ||  F ' (X)  - F'(Y)  ||  < k |{  X - Y ||,  for  any  X,  Y ( D. 


where 


h = 8nK  < 1/2 


n*  = X:  H X ' X0 


< t*  = n 
h n 


(7.25) 


(7.26) 


holds.  Now  if  € D^,  then  the  Newton  iterations 


(k+l  ’ *k  - F(Xk) 


(7.27) 


are  well  defined,  remain  in  ft..,  and  converge  to  X*  £ ft.  such  that  F(X*) 


0.  In  addition 
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r/j 


Proof . A proof  is  given  by  Tapia  (Ref  34:389-392). 

Theorem  7.3.  If  F(X)  is  a matrix  function  satisfying  the  hypotheses 
of  Theorems  7.1  and  7.2,  then  the  inverse  of  a n by  n nonsingular  matrix 
A is  given  by  the  limit  of  the  sequence 


.-1, 


or 


’Sc+l  ’ Xk  “ [r'<*k)!  P<XkK  k - 0.  1.  2, 
*1*1  - V21  - V1- 


k - 0,  1,  2,  • • 


(7.29) 


(7.30) 


Proof . By  choosing  F(X)  to  be  X - A,  the  recursion  formula 
follows  from  Eq  (7.27): 

W ■ \ - [F'(VrlF<V 


= Xk  - [-X  2]-1tX-1  - A] 

• \ + *k  - \2a 
= - XkA] 


(7.31) 


Example  7.3.  To  illustrate  the  formula  just  presented  the  following 
matrices  are  used: 


"l 

1 

o' 

‘l 

-1 

o" 

1/2 

0 

o' 

A = 

0 

1 

0 

II 

< 

0 

1 

0 

X 

O 

II 

0 

1 

0 

0 

0 

1 

* 

0 

0 

1 

* 

_ 0 

0 

1_ 

k.  I » 


Now 


<k+i  = V21  - xkA) 


(7.32) 


(7.30) 


where  F(X)  - -X  2 = -(X"1)(X_1)  and  ||  XQ||  = max  |a±J  | , i,j  = 1,  2, 
. . . , n,  and  Eqs  (7.24)  become 
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(i)||  [F'(Xq)]  I!  = 


0 -1  0=6=1 


0 0-1 


(ii)  ||  [F'(X0)]_1F(XC 


-[Xq"2]"1  II 

-X02(X0_1  - A) 


(-X0  + X A) 


-1/2  0 0|  fl/4  0 Olfl  1 Ol 


0 -1  0+0  10010 


0 -1 


0 0 1 0 0 1 


0 -1 


-1/2  0 0|  fl/4  1/4  0 


0-10+0  10 


0 0 1 


-1/4  1/4  0 


0 0 0 


(iii)||  F ' (X)  - F ' (Y)  || 


-x~2  + y“2| 


(Y-1X  1 - X-2)  + (Y  2 - Y_1X_1)  || 

(Y-1  - X_1)X-1 1|  + ||  Y_1(Y_1  - X-1)  |J 
(X-1  - Y-1)X-1 1|  + || Y_1  (X-1  - Y'1)  || 

Y_1(Y  - X)X"1-X"1||  + ||  Y_1(Y_1)  (Y  - X)X_1| 


Y 1||  • ||X  1\\  2- 


Y “ X ||  + 1 1 Y~ 1 1|  2-|i  Y - X | 


II  Y_1 1|  * ||X_1||  2 + ||  Y_1 1|  2-  ||  X_1| 
Y - X\\ 


Therefore,  for 


! -»  way  


VIII.  Conclusion 

The  increasing  use  of  matrix  equations  in  the  engineering  sciences 
has  stimulated  a commensurate  interest  in  the  investigation  of  new  ways 
to  solve  these  equations.  The  original  computational  methods  presented 
in  this  report  can  simplify  calculation  of  solutions  of  certain  matrix 
equations.  These  new  techniques  are  easily  adapted  for  use  by  the  digi- 
tal computer,  in  some  cases  eliminating  the  need  for  lengthy  iteration 
processes.  Several  new  representations  of  solutions  can  possibly  be 
extended  resulting  in  additional  areas  of  their  application.  The  itera- 
tion scheme  presented  in  the  last  chapter  could  form  the  basis  of  another 
technique  for  finding  solutions  of  the  Liapunov  and  Riccati  equations. 
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conditions  on  the  matrices.  Necessary  and  sufficient  conditions  are  established 
for  the  existence  of  a solution  of  AX  = C where  the  elements  of  the  matrices 
belong  to  the  polynomial  domain  of  the  field  of  real  numbers.  Representations 
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Block  19. 

Linear  Differential  Equations 
Nonlinear  Differential  Equations 

Block  20. 

of  solutions  to  the  algebraic  Riccati  equation  are  given  along  with  original 
techniques  for  computation  of  solutions  and  conditions  for  uniqueness.  Suffi- 
cient conditions  for  the  existence  of  the  generalized  nonlinear  algebraic  and 
differential  Riccati  equations  are  established.  The  classical  Newton-Raphson 
method  of  finding  zeros  of  functions  of  a single  variable  is  extended  to  the 
case  of  finding  zeros  of  functions  of  a matrix  variable,  including  determining 
the  inverse  of  a matrix.  An  extensive  bibliography  includes  a comprehensive 
listing  of  articles  which  provide  historical  and  developmental  background  on 
the  matrix  equations  considered  in  this  thesis. 
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